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ENZYMATIC DNA MOLECULES 



JPPHMir AL FIELD 

. The present invention relates to nucleic acid enzymes or catalytic (enzymatic) 
DNA molecules that are capable of cleaving other nucleic acid molecules, particularly 
RNA. The present invention also relates to compositions containing the disclosed 
enzymatic DNA molecules and to methods of making and using such enzymes and 
compositions. 

RADffiROUND 

The need for catalysis that operate outside of their native context or which 
catalyze'reactions that are not represented in nature has resu.ted in the development of 
■enzyme engineering" technology. The usual route taken in enzyme engineering has 
been a "rations, design' approach, relying upon the understanding of natural enzymes to 
aid in the construction of new enzymes. Unfortunately, the state of proficiency in the 
areas of protein structure and chemistry is insufficient to make the generetion of novel 

biological catalysts routine. 

Recently, a different approach for developing novel catalysts has been applied. 
This method involves the construction of a heterogeneous pool of macromolecu.es and 
the application of an in vitro selection procedure to isolate mo.ecu.es from the poo. that 
catalyze the desired reaction. Selecting cata.ysts from a poo. of macromo.ecules is not 
dependent on a comprehensive understanding of their structural and chemical 
properties. Accordingly, this process has been dubbed -irrational design" (Brenner and 
Lerner, pnas USA 89: 5381-5383 (1992)). 

Most efforts to date involving the rational design of enzymatic RNA molecules or 
ribozymes have not led to molecules with fundamentally new or improved cataiytic 
function. However, the application of irrational design methods via a process we have 
described as "directed molecular evolution" or 'in vitro evolution", which is patterned 
after Darwinian evolution of organisms in nature, has the potential to lead to the 
production of DNA molecules that have desirable funotiona. characterist.es. 

This technique has been applied with vexing degrees of success to RNA 
m0 lecu.es in solution (see. e.g.. Mi.is. et a... PNAS , USA 59 : 217 (1967): Green, et a... 
M 347 : 406 (1990); Chowrira, et a.., MaUit^: 320 (1991); Joyce. (texJU: 83 
(19891- Beaudry and Joyce. SfiisaCB 252: 635-641 (1992); Robertson and Joyce, 
Mature 344 : 467 (1990)). as well as to RNAs bound to a ligand that is attached to a 
solid support (Tuerk. et a.., SslsMS 249: 505 (1990); EUington. et a... Mttlilfi 346 : 818 
(19901). It has also been applied to peptides attached directly to e solid support (Lam. 
et al.. mm 354 : 82 11991)1; and to peptide epitopes expressed within a viral coat 
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protein {Scott, et al., ScisnCS 249: 386 (1990); Devlin, at al., Science ?4fl - 249 (1990); 
Cwirla, et ah, PNAS USA 87 : 6378 (1990)). 

It has been more than a decade since the discovery of catalytic RNA (Kruger, et 
al., Cell 31 : 147-157 (1982); Guerrier-Takada, et al., Cell 35 : 849-857 (1983)). The list 
of known naturally-occurring ribozymes continues to grow (see Cech, in The RNA Wn f |rj 
Gesteland & Atkins (eds.), pp. 239-269, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1993); Pyle, Science 251: 709-714 (1993); Symons, Curr. Ooin , 
Struct. BlflL 4: 322-330 (1994)) and, in recent years, has been augmented by synthetic 
ribozymes obtained through in vitro evolution. (See, e.g., Joyce, Curr. Onin, Struct. 
Bl0», 4: 331-336 (1994); Breaker & Joyce, Trends Biotech. 17 ? 268-275 (1994); 
Chapman & Szostak, Curr. Ooin. Struct. Binl. d- 618-622 (1994).) 

It seems reasonable to assume that DNA can have catalytic activity as well, 
considering that it contains most of the same functional groups as RNA. However, with 
the exception of certain viral genomes and replication intermediates, nearly all of the 
DNA in biological organisms occurs as a complete duplex, precluding it from adopting a 
complex secondary and tertiary structure. Thus it is not surprising that DNA enzymes 
have not been found in nature. 

Until the advent of the present invention, the design, synthesis and use of 
catalytic DNA molecules with nucfeotide-cleaving capabilities has not been disclosed or 
demonstrated. Therefore, the discoveries and inventions disclosed herein are . 
particularly significant* in that they highlight the potential of in vitro evolution as a 
means of designing increasingly more efficient catalytic molecules, including enzymatic 
DNA molecules that cleave other nucleic acids, particularly RNA. 

BRIEF SUMMAR Y OF THE INVENTION 
The present invention thus contemplates a synthetic or engineered (i.e., non- 
natura(ly-occurring) catalytic DNA molecule (or enzymatic DNA molecule) capable of 
cleaving a substrate nucleic acid (NA) sequence at a defined cleavage site. The 
invention also contemplates an enzymatic DNA molecule having an endonuclease 
activity. 

In one preferred variation, the endonuclease activity is specific for a nucleotide 
sequence defining a cleavage site comprising single-stranded nucleic acid in a substrate 
nucleic acid sequence. In another preferred variation, the cleavage site is double- 
stranded nucleic acid. Similarly, substrate nucleic acid sequences may be single- 
stranded, double-stranded, partially single- or double-stranded, looped, or any 
combination thereof. 

In another contemplated embodiment, the substrate nucleic acid sequence 
includes one or more nucleotide analogues. In one variation, the substrate nucleic acid 
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sequenca is a portion of, or attached to, a larger molecule. 

In various embodiments, the larger molecule is selected from the group 
consisting of RNA, modified RNA, DNA, modified DNA. nucleotide analogs, or 
composites thereof. In another example, the larger molecule comprises a composite of a 
5 nucleic acid sequence and a non-nucleic acid sequence. 

In another embodiment, the invention contemplates that a substrate nucleic acid 
sequence includes one or more nucleotide analogs. A further variation contemplates 
that the single stranded nucleic acid comprises RNA, DNA, modified RNA, modified 
DNA, one or more nucleotide analogs, or any composite thereof. In one embodiment of 

10 the disclosed invention, the endonudease activity comprises hydrolytic cleavage of a 

phosphoester bond at the cleavage site. 

In various preferred embodiments, the catalytic DNA molecules of the present 
invention are single-stranded in whole or in part. These catalytic DNA molecules may 
preferably assume a variety of shapes consistent with their catalytic activity. Thus, in 

1 5 one variation, a catalytic DNA molecule of the present invention includes one or more 

hairpin loop-structures. In yet another variation, a catalytic DNA molecule may assume 
a shape similar to that of "hammerhead- ribozymes. In still other embodiments, a 
catalytic DNA molecule may assume a conformation similar to that of Tetrahymena 
Thermophila ribozymes, e.g., those derived from group I introns. 

20 Similarly, preferred catalytic DNA molecules of the present invention are able to 

demonstrate site-specific endonudease activity irrespective of the original orientation of 
the substrate molecule. Thus, in one preferred embodiment, an enzymatic DNA 
molecule of the present invention is able to cleave a substrate nucleic acid sequence 
that is separate from the enzymatic DNA molecule - i.e., it is not linked to the 

25 DNAzyme. In another preferred embodiment, an enzymatic DNA molecule is able to 

cleave an attached substrate nucleic acid sequence - i.e., it is able to perform a reaction 
similar to self-cleavage. 

The invention also contemplates enzymatic DNA molecules (catalytic DNA 
molecules, deoxyribozymes or DNAzymes) having endonudease activity, whereby the 

30 endonudease activity requires the presence of a divalent cation. In various preferred, 

alternative embodiments, the divalent cation is selected from the group consisting of 
Pb 2 \ Mg 2+ , Mn 2 *, Zn 2 \ and Ca 2 *. Another variation contemplates that the 
endonudease activity requires the presence of a monovalent cation. In such alternative 
embodiments, the monovalent cation is preferably selected from the group consisting of 

35 Na* and K*\ 

In various preferred embodiments of the invention, an enzymatic DNA molecule 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NO 3, 



WO 96/17086 



PCT/US95/15580 



-4- 

SEQ ID NO 14; SEQ ID NO 15; SEQ ID NO 16; SEQ ID NO 17; SEQ ID NO 18; SEQ ID 
NO 1 9; SEQ ID NO 20; SEQ ID NO 21 ; and SEQ ID NO 22. In other preferred 
embodiments, a catalytic DNA molecule of the present invention comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NO 23; SEQ ID NO 24; SEQ ID r 
NO 25; SEO ID NO 26; SEQ ID NO 27; SEQ ID NO 28; SEQ ID NO 29; SEQ ID NO 30; 
SEQ ID NO 31; SEQ ID NO 32; SEQ ID NO 33; SEQ ID NO 34; SEQ ID NO 35; SEQ ID 
NO 36; SEQ ID NO 37; SEQ ID NO 38; and SEQ ID NO 39. 

Another preferred embodiment contemplates that a catalytic DNA molecule of 
the present invention comprises a nucleotide sequence selected from the group 
consisting of SEQ ID NO 50 and SEQ ID NO 51. In yet another preferred embodiment, a 
catalytic DNA molecule of the present invention comprises a nucleotide sequence 
selected from the group consisting of SEQ ID NOS 52 through 101. As disclosed 
herein, catalytic DNA molecules having sequences substantially similar to those 
disclosed herein are also contemplated. Thus, a wide variety of substitutions, deletions, 
insertions, duplications and other mutations may be made to the within-described 
molecules in order to generate a variety of other useful enzymatic DNA molecules; as 
long as said molecules display site-specific cleavage activity as disclosed herein, they 
are within the boundaries of this disclosure. 

In a further variation of the present invention, an enzymatic DNA molecule of the 
present invention preferably has a substrate binding affinity of about 1 ptA or less. In 
another embodiment, an enzymatic DNA molecule of the present invention binds 
substrate with a Ko of less than about 0.1 //M. 

The present invention also discloses enzymatic DNA molecules having useful 
turnover rates. In one embodiment, the turnover rate is less than 5 hr'; in a preferred 
embodiment, the rate is less than about 2 hr 1 ; in a more preferred embodiment, the rate 
is less than about 1hr\- in an even more preferred embodiment, the turnover rate is 
about 0.6 hr 1 or less. 

In still another embodiment, an enzymatic DNA molecule of the present 
invention displays a useful turnover rate wherein the k o6l is less than 1 min'\ preferably 
less than 0.1 min* 1 ; more preferably, less than 0.01 min' 1 ; and even more preferably, 
less than 0.005 min 1 . In one variation, the value of k obl is approximately 0.002 min"' or 
less. 

The present invention also contemplates embodiments in which the catalytic rate 
of the disclosed DNA enzymes is fully optimized. Thus, in various preferred 
embodiments, the K„, for reactions enhanced by the presence of Mg 2+ is approximately 
0.5-20 mM, preferably about 1-10 mM, and more preferably about 2-5 mM. 

The present invention also contemplates an embodiment whereby the nucleotide 
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sequence defining the cleavage site comprises at least one nucleotide. In various other 
preferred embodiments, a catalytic DNA molecule of the present invention is able to 
recognize and cleave a nucleotide sequence defining a cleavage site of two or more 
nucleotides. 

5 In various preferred embodiments, an enzymatic DNA molecule of the present 

invention comprises a conserved core flanked by one or more substrate binding regions. 
In one embodiment, an enzymatic DNA molecule includes first and second substrate 
binding regions. In another embodiment, an enzymatic DNA molecule includes two or 
more substrate binding regions. 
! o as noted previously, preferred catalytic DNA molecules of the present invention 

may also include a conserved core. In one preferred embodiment, the conserved core 
comprises one or more conserved regions. In other preferred variations, the one or more 
conserved regions include a nucleotide sequence selected from the group consisting of 
CG: CGA; AGCG; AGCCG; CAGCGAT; CTTGTTT; and CTTATTT (see, e.g.. Fig. 3). 
1 5 In one embodiment of the invention, an enzymatic- DNA- molecule of thr present 

invention further comprises one or more variable or spacer nucleotides between the 
conserved regions in the conserved core. In another embodiment, an enzymatic DNA 
molecule of the present invention further comprises one or more variable or spacer 
nucleotides between the conserved core and the substrate binding region. 
20 In one variation, the first substrate binding region preferably includes a 

nucleotide sequence selected from the group consisting of CATCTCT: GCTCT; 
TTGCTTTTT; TGTCTTCTC: TTGCTGCT; GCCATGCTTT ISEQ 10 NO 401; CTCTATTTCT 
(SEQ ID NO 41); GTCGGCA; CATCTCTTC; and ACTTCT. In another preferred variation, 
the second substrate binding region includes a nucleotide sequence selected from the 
25 group consisting of TATGTGACGCTA (SEQ ID NO 42); TATAGTCGTA (SEQ ID NO 43); 

ATAGCGTATTA (SEQ 10 NO 44); ATAGTTACGTCAT (SEQ ID NO 45); 
AATAGTGAAGTGTT (SEQ ID NO 46); TATAGTGTA; ATAGTCGGT; ATAGGCCCGGT 
(SEQ ID NO 47); AATAGTGAGGCTTG (SEQ ID NO 48): and ATGNTG. 

In various ernbodiments^l^res*,^ re 9 ions 
30 vary in length. Thus, for example, a substrate binding region may comprise a single 

nucleotide to dozens of nucleotides. However, it is understood that substrate b.nd.ng 
regions of about 3-25 nucleotides in length, preferably about 3-15 nucleotides in length, 
and more preferably about 3-10 nucleotides in length are particularly preferred. In 
various embodiments, the individual nucleotides in the substrate binding regions are able 
35 to form complementary base pairs with the nucleotides of the substrate molecules; m 

other embodiments, noncomplementary base pairs are formed. A mixture of 
complementary and noncomplementary base pairing is also contemplated as falling 
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within the scope of the disclosed embodiments of the invention. 

In another preferred embodiment, a cataiytic DNA molecule of the present 
invention may further comprise a third substrate binding region. In some preferred 
embodiments, the third region includes a nucleotide sequence selected from the group 
consisting of TGTT; TGTTA; and TGTTAG. Another preferred embodiment of the 
present invention discloses an enzymatic DNA molecule further comprising one or more 
variable or "spacer" regions between the substrate binding regions. 

in another disclosed embodiment, the present invention contemplates a purified, 
synthetic enzymatic DNA molecule separated from other DNA molecules' and 
oligonucleotides, the enzymatic DNA molecule having an endonuclease activity, wherein 
the endonuclease activity is specific for a nucleotide sequence defining a cleavage site 
comprising single- or double-stranded nucleic acid in a substrate nucleic acid sequence. 
In one variation, a synthetic (or engineered) enzymatic DNA molecule having an 
endonuclease activity is disclosed, wherein the endonuclease activity is specific for a 
nucleotide sequence- defining a cleavage site consisting essentially of a single- or double- 
stranded region of a substrate nucleic-acid sequence. 

In yet another embodiment, the invention contemplates an enzymatic DNA 
molecule comprising a deoxyribonucieotide polymer having a catalytic activity for 
hydrolyzing a nucleic acid-containing substrate to produce substrate cleavage products. 
In one variation, the hydrolysis takes place in a site-specific manner. As noted 
previously, the polymer may be single-stranded, double-stranded, or some combination 
of both. 

The invention further contemplates that the substrate comprises a nucleic acid 
sequence. In various embodiments, the nucleic acid sequence substrate comprises 
RNA, modified RNA, DNA. modified DNA, one or more nucleotide analogs, or 
composites of any of the foregoing. One embodiment contemplates that the substrate 
includes a single-stranded segment; still another embodiment contemplates that the 
substrate is double-stranded. 

Thespresent invention-aiso-.contemplates an enzymatic DNA molecule comprising 
a deoxyribonucieotide polymer having a catalytic activity for hydrolyzing a nucleic acid- 
containing substrate to produce a cleavage product. In one variation, the enzymatic 
DNA molecule has an effective binding affinity for the substrate and lacks an effective 
binding affinity for the cleavage product. 

In one preferred embodiment, the invention discloses a non-naturally-occurring 
enzymatic DNA molecule comprising a nucleotide sequence defining a conserved core 
flanked by recognition domains, variable regions, and spacer regions. Thus, in one 
preferred embodiment, the nucleotide sequenca defines a first variable region contiguous 
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or adjacent to the 5'-terminus of the molecule, a first recognition domain located 3'- 
terminai to the first variable region, a first spacer region located 3'-terminal to the first 
recognition domain, a first conserved region located 3*-terminal to the first spacer 
region, a second spacer region located 3'-terminal to the first conserved region, e 
5 second conserved region located 3'-terminal to the second spacer region, a second 

recognition domain located 3'-terrninal to the second conserved region, and a second 
variable region located 3'-terminal to the second recognition domain. 

In another embodiment, the nucleotide sequence preferably defines a first 
variable region contiguous or adjacent to the 5*-terminus of the molecule, a first 

10 recognition domain located 3'-terminal to the first variable region, a first spacer region 

located 3'*terminal to the first recognition domain, a first conserved region located 3'- 
terminal to the first spacer region, a second spacer region located 3*-terminal to the first 
conserved region, a second conserved region located 3'-terminal to the second spacer 
region, a second recognition domain located 3'-terminal to the second conserved region, 

15 a second variable region located 3*-terminal to the second recognition domain, and a 

third recognition domain located 3'-terminal to the second variable region. 

In one variation of the foregoing, the molecule includes a conserved core region 
flanked by two substrate binding domains; in another, the conserved core region 
comprises one or more conserved domains. In other preferred embodiments, the 

20 conserved core region further comprises one or more variable or spacer nucleotides. In 

yet another embodiment, an enzymatic DNA molecule of the present invention further 
comprises one or more spacer regions. 

The present invention further contemplates a wide variety of compositions. For 
example, compositions including an enzymatic DNA molecule as described hereinabove 

25 are disclosed and contemplated herein. In one alternative embodiment, a composition 

according to the present invention comprises two or more populations of enzymatic 
DNA molecules as described above, wherein each population of enzymatic DNA 
molecules is capable of cleaving a different sequence in a substrate. In another 
variation, a composition comprises two or more populations of enzymatic- DNA 

30 molecules as described hereinabove, wherein each population of enzymatic DNA 

molecules is capable of recognizing a different substrate. In various embodiments, it is 
also preferred that compositions include a monovalent or divalent cation. 

The present invention further contemplates methods of generating, selecting, 
and isolating enzymatic DNA molecules of the present invention. In one variation, a 

35 method of selecting enzymatic DNA molecules that cleave a nucleic acid sequence (e.g., 

RNA) at a specific site comprises the following steps: (al obtaining a population of 
putative enzymatic DNA molecules - whether the sequences are naturally-occurring or 
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synthetic - and preferably, they are single-stranded DNA molecules; (bl admixing 
nucleotide-containing substrate sequences with the aforementioned population of DNA 
molecules to form an admixture; (c) maintaining the admixture for a sufficient period of 
time and under predetermined reaction conditions to allow the putative enzymatic DNA 
molecules in the population to cause cleavage of the substrate sequences, thereby 
producing substrate cleavage products; (d) separating the population of DNA molecules 
from the substrate sequences and substrate cleavage products; and (e} isolating DNA 
molecules that cleave substrate nucleic acid sequences (e.g., RNA) at a specific site 
from the population. 

In a further variation of the foregoing method, the DNA molecules that cleave 
substrate nucleic acid sequences at a specific site are tagged with an immobilizing 
agent. In one example, the agent comprises biotin. 

In yet another variation of the aforementioned method, one begins by selecting a 
sequence - e.g., a predetermined "target" nucleotide sequence - that one wishes to 
cleave using an enzymatic DNA molecule engineered for that purpose. Thus, in one 
embodiment, the pre-selected (or predetermined) "target" sequence is used to generate 
a population of DNA molecules capable of cleaving substrate nucleic acid sequences at 
a specific site via attaching or "tagging" it to a deoxyribonucleic acid sequence 
containing one or more randomized sequences or segments. In one variation, the 
randomized sequence is about 40 nucleotides in length; in another variation, the 
randomized sequence is about 50 nucleotides in length. Randomized sequences that are 
1-40, 40-50, and 50-100 nucleotides in length are also contemplated by the present 
invention. 

In one embodiment of the present invention, the nucleotide sequence used to 
generate a population of enzymatic DNA molecules is selected from the group consisting 
of SEQ ID NO 4, 23, 50 AND 51 . In another embodiment, the 'target" or "substrate" 
nucleotide sequence comprises a sequence of one or more ribonucleotides - see, e.g., 
the relevant portions of SEQ ID NOS 4 and 23, and SEQ ID NO 49. It is also 
contemplated by the present invention that a useful "target* or "sxjbsjr^e^nucieotide- 
sequence may comprise DNA, RNA, or a composite thereof. 

The invention also contemplates methods as described above, wherein the 
isolating step further comprises exposing the tagged DNA molecules to a solid surface 
having avidin linked thereto, whereby the tagged DNA molecules become attached to 
the solid surface. As before, the substrate may be RNA, DNA. a composite of both, or 
a molecule including nucleotide sequences. 

The present invention also contemplates a method for specifically cleaving a 
substrate nucleic acid sequence at a particular cleavage site, comprising the steps of (a) 
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providing an enzymatic DNA molecule capable of cleaving a substrate nucleic acid 
sequence at a specific cleavage site: and (b) contacting the enzymatic ONA molecule 
with the substrate nucleic acid sequence to cause specific cleavage of the nucleic acid 
sequence at the cleavage site. In one variation, the enzymatic DNA molecule is e non- 
5 naturally-occurring (or syntheticl DNA molecule. In another variation, the enzymatic 

DNA molecule is single-strended. 

In still another veriation of the foregoing method, the substrate comprises a 
nucleic acid. In various embodiments, the substrate nucleic acid comprises RNA, 
modified RNA, DNA, modified DNA, one or more nucleotide analogs, or composites of 
1 0 any of the foregoing . In yet another embodiment, the specific cleavage is caused by the 

endonuclease activity of the enzymatic DNA molecule. Alteration of reaction conditions 
- e.g., the adjustment of pH, temperature, percent cation, percent enzyme, percent 
substrate, and percent product - is also contemplated herein. 

The present invention also contemplates a method of cleaving a phosphoester 
1 5 bond, comprising la) admixing an catalytic DNA molecule capable of cleaving a 

substrate nucleic-acid sequence at a defined cleavage site with a phosphoester bond- 
containing substrate, to form a reaction admixture; and (b) maintaining the admixture 
under predetermined reaction conditions to allow the enzymatic DNA molecule to cleave 
the phosphoester bond, thereby producing a population of substrate products. In one 
20 embodiment, the enzymatic DNA molecule is able to cleave the phosphoester bond in a 

site-specific manner. In another embodiment, the method further comprises the steps of 
(c) separating the products from the catalytic DNA molecule; and (d) adding additional 
substrate to the enzymatic DNA molecule to form a new reaction admixture. 

The present invention also contemplates methods of engineering enzymatic DNA 
25 molecules that cleave phosphoester bonds. One exemplary method comprises the 

following steps: (a> obtaining a population of single-stranded DNA molecules: (b) 
introducing genetic variation into the population to produce a variant population; (c) 
selecting individuals from the variant population that meet predetermined selection 
criteria: <d| separating the selected individuals from the remainder of the variant 
30 population; and le) amplifying the selected individuals. 

P PI pp nPsmip rrN n F T " p swings 
Figure 1 illustrates a selective amplification scheme for isolation of DNAs that 
cleave a target RNA phosphoester. As shown, double-stranded DNA that contains a 
stretch of 50 random nucleotides (the molecule with "N M " indicated above it) is 
35 amplified by PCR. employing a 5 '-biotinylated DNA primer that is terminated at the 3 ' 

end by an adenosine ribonucleotide (rA). (The biotin label is indicated via the encircled 
letter "BM This primer is extended by Taq polymerase to yield a DNA product that 
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contains a single embedded ribonucleotide. The resulting double-stranded DNA is 
immobilized on a streptavidin matrix and the unbiotinylated DNA strand is removed by 
washing with 0.2 N NaOH. After re-equilibrating the column with a buffered solution, 
the column is washed with the same solution with added 1 mM PbOAc. ONAs that 
undergo Pb 2 *-dependent self-cleavage are released from the column, collected in the 
eluant, and amplified by PCR. The PCR products are then used to initiate the next round 
of selective amplification. 

Figure 2 illustrates self-cleavage activity of the starting pool of DNA (GO) and 
populations obtained after the first through fifth rounds of selection IG1 - G5) f in the 
presence of lead cation (Pb 2 *). The symbol Pre represents 108-nucleotide precursor 
DNA (SEQ ID NO 4); Civ, 28-nucleotide 5 '-cleavage product (SEQ ID NO 51; and M, 
primer 3a (SEQ ID NO 6), which corresponds in length to the 5 '-cleavage product. 

Figure 3 illustrates the sequence alignment of individual variants isolated from 
the population after five rounds of selection. The fixed substrate domain is shown at 
the top, with the target riboadenylate identified via an inverted triangle. Substrate 
nucleotides that are commonly involved in presumed base-pairing interactions are 
indicated by vertical bars. Sequences corresponding to the 50 initially-randomized 
nucleotides are aligned antiparallel to the substrate domain. AH of the variants are 
3 '-terminated by the fixed sequence 5 '-CGGTAAGCTTGGCAC-3 ' (not shown; SEQ ID 
NO 1). Nucleotides within the initially-randomized region that are presumed to form 
base pairs with the substrate domain are indicated on the right and left sides of the 
Figure; the putative base-pair-forming regions of the enzymatic DNA molecules are 
individually boxed in each sequence shown. Conserved regions are illustrated via the 
two large, centrally-located boxes. 

Figures 4A and 4B illustrate DNA-catalyzed cleavage of an RNA phosphoester in 
an intermolecular reaction that proceeds with catalytic turnover. Figure 4A is a 
diagrammatic representation of the complex formed between the 19mer substrate (3'- 
TC ACTATrAGG AAGAG ATGG-5', SEQ ID NO 2) and 38mer DNA enzyme (5*- 
ACACATCTCTGAAGTAGCGCCGCCGTATAGTGACGCTA-3', SEQ ID NO 3|. The 
substrate contains a single adenosine ribonucleotide ("rA\ adjacent to the arrow), 
flanked by deoxyribonucleotides. The synthetic DNA enzyme is a 38-nucieotide portion 
of the most frequently occurring variant shown in Fig. 3. Highly-conserved nucleotides 
located within the putative catalytic domain are "boxed". As illustrated, one conserved 
sequence is "AGCG", while another is "CG" (reading in the 5'-3* direction). 

Figure 4B shows an Eadie-Hofstee plot used to determine K m (negative slope) 
and V^, (y-intercept) for DNA-catalyzed cleavage of (5 '-"Pl-labeled substrate under 
conditions identical to those employed during in vitro selection. Initial rates of cleavage 
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were determined for reactions involving 5 nM ONA enzyme and either 0.125. 0.5, 1, 2, 

or 4 nM substrate. 

Figure 5 is a photographic representation showing a polyacrylamida gel 
demonstrating specific endoribonuclease activity of four families of selected catalytic 
5 ONAs. Selection of a Pb~-dependent family of molecules was repeated in a side-by- 

side fashion as a control (first group). In the second group, Zn>* is used as the cation; 
in group three, the cation is Mn"; and in the fourth group, the cation is Mg 1 *. A fifth 
site on the gel consists of the cleavage product alone, as a marker. 

As noted, there are three lanes within each of the aforementioned four groups. 
1 0 In each group of three lanes, the first lane shows the lack of activity of the selected 

population in the absence of the metal cation, the second lane shows the observed 
activity in the presence of the metal cation, and the third lane shows the lack of activity 

of the starting pool (GO). 

Figures 6A and 6B provide two-dimensional illustrations of a "progenitor" 
1 5 catalytic DNA molecule and one of several catalytic DNA molecules-obtained. via the 

selective amplification methods disclosed herein, respectively. Figure 6A illustrates an 
exemplary molecule from the starting pool, showing the overall configuration of the 
molecules represented by SEQ ID NO 23. As illustrated, various complementary 
nucleotides flank the random (NJ region. Bgura 6B is a diagrammatic representation of 
70 one of the Mg'*-dependent catalytic DNA molecules (or "DNAz-ymes") generated via the 

within-described procedures. The location of the ribonucleotide in the substrate nucleic 
acid is indicated via the arrow in both Figs. 6A and 68. 

Figure 7 illustrates some of the results of ten rounds of in vitro selective 
amplification carried out essentia.ly as described in Example 5 hereinbelow. As shown, 
25 two sites and two families of catalysts emerged as displaying the most efficient 

cleavage of the target sequence. Cleavage conditions were essentially as indicated in 
Fig. 7, namely, 10mM Mg", pH 7.5. end 37'C; data co.lected after the reaction ran for 
2 hours is shown. Cleavage (%) is shown plotted against the number of generations 
(here 0 through 10). The nambe^^ieace^^ataiyticDNA. molecules^capable of 
30 cleaving the target sequence at the indicated sites in the substrata is illustrated via the 

vertical bars, with cleavage at Gl UAACUAGAGAU shown by the striped bars, and w,th 
cleavage at GUAACUAlGAGAU illustrated via the open (lightly-shaded) bars. 

Figure 8 illustrates the nucleotide sequences, cleavage sites, and turnover rates 
of two catalytic DNA molecules of the present Invention, clones 8-17 and 10-23. 
35 Reaction conditions were as shown, namely, 10mM Mg- pH 7.5. and 37"C. The 

ONAzyme identified as clone 8-17 is illustrated on the left, with the site of cleavage of 
the RNA substrate indicated by the arrow. The substrate sequence 15' - 
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GGAAAAAGUAACUAGAGAUGGAAG - 3') - which is separate from the DNAzyme (i.e., 
intermolecular cleavage is shown) - is labeled as such. Similarly, the DNAzyme 
identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 
substrate indicated by the arrow. Again, the substrate sequence is indicated. For the 8- 
17 enzyme, the turnover rate was approximately 0.6 hr 1 ; for the 10-23 enzyme, the 
turnover rate was approximately 1 hr'. Noncomplementary pairings are indicated with a 
closed circle (•), whereas complementary pairings are indicated with a vertical line f|). 

Figure 9 further illustrates the nucleotide sequences, cleavage sites, and 
turnover rates of two catalytic DNA molecules of the present invention, clones 8-17 and 
10-23. Reaction conditions were as shown, namely, 10mM Mg a \ pH 7.5, and 37°C. 
As in Fig. 8, the DNAzyme identified as clone 8-17 is illustrated on the left, with the site 
of cleavage of the RNA substrate indicated by the arrow. The substrate sequence (5* - 
GGAAAAAGUAACUAGAGAUGGAAG - 3') -which is separate from the DNAzyme (i.e., 
intermolecular cleavage is shown) - is labeled as such. Similarly, the DNAzyme 
identified herein as 10-23 is shown on the right, with the^site-of-xleavage of the RNA 
substrate indicated by the arrow. Again, the substrate sequence is indicated. For the 8- 
17 enzyme, was approximately 0.002 min 1 ; for the 10-23 enzyme, the value of k 0to4 
was approximately 0.01 min'\ Noncomplementary pairings are indicated with a closed 
circle {•), whereas complementary pairings are indicated with a vertical line (|). 

DETAILED DESCRIPTION 

a. Definitions 

As used herein, the term "deoxyribozyme" is used to describe a DNA-containing 
nucleic acid that is capable of functioning as an enzyme. In the present disclosure, the 
term "deoxyribozyme" includes endoribonucleases and endodeoxyribonucleases, 
although deoxyribozymes with endoribonuclease activity are particularly preferred. 
Other terms used interchangeably with deoxyribozyme herein are "enzymatic DNA 
molecule", "DNAzyme", or "catalytic DNA molecule", which terms should all be 
understood to include enzymatically active portions thereof, whether they are produced 
synthetically, or derived, from onganisms-or other sources. 

The term "enzymatic DNA molecules" also includes DNA molecules that have 
complementarity in a substrate-binding region to a specified oligonucleotide target or 
substrata; such molecules also have an enzymatic activity which is active to specifically 
cieave the oligonucleotide substrate. Stated in another fashion, the enzymatic DNA 
molecule is capable of cleaving the oligonucleotide substrate intermolecuiarly. This 
complementarity functions to allow sufficient hybridization of the enzymatic DNA 
molecule to the substrate oligonucleotide to allow the intermolecular cleavage of the 
substrate to occur. While one-hundred percent (100%) complementarity is preferred, 
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complementaritY in the range of 75-100% is also usefu. and contemplated by the 
present invention. 

Enzymatic ONA molecules of .he present invention may alternatively be 
described as having nuclease or ribonuclease activity. These terms may be used 
interchangeably herein. 

The term "enzymatic nucleic acid" as used herein encompasses enzymauc RNA 
or DNA molecu.es. enzymatic RNA-DNA po.ymers. and enzymatical.y active portions or 
derivatives thereof, a.though enzymatic ONA molecu.es are a particu.ar, preferred Cass 
of enzymatical, active molecu.es according to the present invents. ■ 

The term "endodeoxyribonuc.ease", as used herein, is an enzyme capable «f 
Ceaving a substrate comprised predominant, of ONA. The term "endoribonuclease as 
Z herein, is an enzyme capable of cleaving a substrate comprised predom.nant, of 
RNA 

' As used herein, the term "base pair" (bp) is genera.ly used to describe a 
partnership of adenine (A, with thymine ,T> or uracil <U, or of cytosine ,C, with guar™ 
(G) . enough it shou.d be appreciated that .ass-common ana.ogs of the bases A T C. 
and G (as we., as U> may occasional participate in base pairings. Nuc.eot.des th t 
normal pair up when DNA or RNA adopts a double stranded configurate may a.so be 
referred to herein as -complementary bases". 

. C omp.ementary nuc.eotide sequence" genera.ly refers to a sequence of 
nuclides in a sing.e-stranded mo.ecule or segment of ONA or RNA that is su « 
complementary to that on another sing, o.igonuc.eotide strand to specmcal, hybnd.ze 
to it with consequent hydrogen bonding. 

MuOeoL" genera., refers to a monomeric unit of ONA or BNA cons.st.ng of a 
sugar moiety .pentose,, a phosphate group, and a nitrogenous 

ba e is ft** to the sugar moiety via the g.ycosid, carbon ,V carbon of he pentose, 
. and that combination of base and sugar is a "nuc.eoside". When the 

contains a phosphate group bonded to the .■ or »■ position of the pentose ,t , re er red 
I! as a nucleotide. A sequence of operative, ,, k ed nucleotides is typ.ca,, referred 
. . . basa seqU ence" or "nucleotide sequence", and their grammafcal 

: :,: , rjz— — * . — »<• - - — - * * 
- - — • - *"* -~ ,h ; "irr^in 

-L,r p.n».... « • — " •< »• — A " "•"■^ 
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wherein the base has been altered is provided in section C hereinbelow. 

"Oligonucleotide or polynucleotide" generally refers to a polymer of single- or 
double-stranded nucleotides. As used herein, "oligonucleotide" and its grammatical 
equivalents will include the full range of nucleic acids. An oligonucleotide will typically 
5 refer to a nucleic acid molecule comprised of a linear strand of ribonucleotides. The 

exact size will depend on many factors, which in turn depends on the ultimate 
conditions of use, as is well known in the art. 

As used herein, the term "physiologic conditions" is meant to suggest reaction 
conditions emulating those found in mammalian organisms, particularly humans. While . 

10 variables such as temperature, availability of cations, and pH ranges may vary as 

described in greater detail below, "physiologic conditions" generally comprise a 
temperature of about 35-40 8 C, with 37°C being particularly preferred, as well as a pH 
of about 7.0-8.0, with 7.5 being particularly preferred, and further comprise the 
availability of cations, preferably divalent and/or monovalent cations, with a 

1 5 concentration of about 2-15 mM Mg 2 * and 0-t.O M Na+ being particularly preferred. 

"Physiologic conditions", as used herein, may optionally include the presence of free 
nucleoside cofactor. As noted previously, preferred conditions are described in greater 
detail below. 

B. gn?vmatic DNA Molecules 

20 In various embodiments, an enzymatic DNA molecule of the present invention 

may combine one or more modifications or mutations including additions, deletions, and 
substitutions. In alternative embodiments, such mutations or modifications may be 
generated using methods which produce random or specific mutations or modifications. 
These mutations may, for example, change the length of, or alter the nucleotide 

25 sequence of, a loop, a spacer region or the recognition sequence lor domain). One or 

more mutations within one cataiytically active enzymatic DNA molecule may be 
combined with the mutation(s) within a second cataiytically active enzymatic DNA 
molecule to produce a new enzymatic DNA molecule containing the mutations of both 
molecules. 

30 In other preferred embodiments, an enzymatic DNA molecule of the present 

invention may have random mutations introduced into it using a variety of methods well 
known to those skilled in the art. For example, the methods described by Cadweli and 
Joyce (PCR Methods and Applications 2 : 28-33 (1992)) are particularly preferred for use 
as disclosed herein, with some modifications, as described in the Examples that follow. 

35 (Also see Cadweli and Joyce, PCR Method s and Applications 3 (SupdI,): S1 36-S140 

{1994}.) According to this modified PCR method, random point mutations may be 
introduced into cloned genes. 
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Tha aforementioned methods have been used, for example, to mutagenize genes 
enooding ribozymes with a mutation rate of 0.66% ± 0.13% |9S% confidence interval, 
per position, as determined by sequence analysis, with no strong preferences observed 
' with respect to the type of base substitution. This allows the introduction of random 
5 mutations at any position in the enzymatic DNA molecules of the present invention. 

Another method useful in introducing defined or random mutations is disclosed 
in Joyce and Inoue. BSttflldl 12= 71 1-722 (1989). This latter method 

involves excision of a template (coding) strand of a double-stranded DNA, reconstruction 
of the template strand with inclusion of mutagenic oligonucleotides, and subsequent 
, 0 transcription of the partially-mismatched template, This allows the introduction of 

defined or random mutations at any position in the molecule by includ.ng 
polynucleotides containing known or random nucleotide sequences at selected pos,t,ons. 

Enzymatic DNA molecules of the present invention may be of varying lengths 
and folding patterns, as appropriate, depending on the type and function of the 
1 5 mo.ecu,e. For example, enzyrntfhrDNA-mo.ecu.es may be about 1 5 to about 400 or 

more nucleotides* length, although a .ength not exceeding about 250 nucleotides ,s 
preferred, to avoid limiting the therapeutic usefu.ness of molecules by making them too 
lar ge or unwieldy. In various preferred embodiments, an enzymatic ONA molecule of the 
present invention is at least about 20 nucleotides in length and. while useful molecules 
20 may exceed 100 nucleotides in length, preferred molecules are genera.ly not more than 

about 100 nucleotides in length. 

,„ various therapeutic applications, enzymatic DNA mo.ecu.es of the present 
invention comprise the enzvmatica.ly active portions of deoxyribozymes. In venous 
embodiments, enzymatic DNA molecules of the present invention preferably compnse 
25 not more than ebout 200 nucleotides. In other embodiments, a deoxyribozyme of the 

present invention comprises not more than about 100 nuc.eotides. In sti« other 
preferred embodiments, deoxyribozymes of the present invention are about 20-75 
nucleotides In length, more preferably about 20-65 nucleotides in .ength. Other 
preferred^enzymatic-DNA^olecules are about 10-50 nucleotides in length. 
30 In other epp.ications. enzymatic DNA molecules may assume configurates 

^ similar to those of -hammerhead" ribozymes. Such enzymatic DNA molecules are 

preferably no more than about 75-100 nucleotides in length, with e length of ebout 20- 
50 nucleotides being particularly preferred. 

|„ general, if one intends to synthesize moleou.es for use as disclosed herem. the 
35 lerger the enzymatic nuc.eic ecid molecule is. the more difficu.t it is to synthes.ze. 

Those of skill in the art will certainly appreciate these design constraints. Nevertheless, 
such larger molecules remein within the scope of the present invention. 
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It is also to be understood that an enzymatic DNA molecule of the present 
invention may comprise enzymatically active portions of a deoxyribozyme or may 
comprise a deoxyribozyme with one or more mutations, e.g.. with one or more base- 
pair-forming sequences or spacers absent or modified, as long as such deletions, 
5 additions or modifications do not adversely impact the molecule's ability to perform as 

an enzyme. 

The recognition domain of an enzymatic DNA molecule of the present invention 
typically comprises two nucleotide sequences flanking a catalytic domain, and typically 
contains a sequence of at least about 3 to about 30 bases, preferably about 6 to about 

10 15 bases, which are capable of hybridizing to a complementary sequence of bases 

within the substrate nucleic acid giving the enzymatic DNA molecule its high sequence 
specificity. Modification or mutation of the recognition site via well-known methods 
allows one to alter the sequence specificity of an enzymatic nucleic acid molecule. 
(See, e.g, Joyce et al.. Nucleic Acids Research 17 : 711-712 (1989.1) 

1 5 Enzymatic nucleic acid molecules of the present invention also include those 

with altered recognition sites or domains. In various embodiments, these altered 
recognition domains confer unique sequence specificities on the enzymatic nucleic acid 
molecule including such recognition domains. The exact bases present in the 
recognition domain determine the base sequence at which cleavage will take place. 

20 Cleavage of the substrate nucleic acid occurs within the recognition domain. This 

cleavage leaves a 2*, 3', or 2',3 , -cyclic phosphate group on the substrate cleavage 
sequence and a 5* hydroxy! on the nucleotide that was originally immediately 3* of the 
substrate cleavage sequence in the original substrate. Cleavage can be redirected to a 
site of choice by changing the bases present in the recognition sequence (internal guide 

25 sequence). See Murphy et al., Proc. Natl. Acad. Sci. USA 86 : 921 8-9222 (1 989). 

Moreover, it may be useful to add a polyamine to facilitate recognition and 
binding between the enzymatic DNA molecule and its substrate. Examples of useful 
polyamines include spermidine, putrescine or spermine. A spermidine concentration of 
about 1 mM may be effective in particular embodiments, while concentrations ranging 

30 from about 0.1 mM to about 10 mM may also be useful. 

In various alternative embodiments, an enzymatic DNA molecule of the present 
invention has an enhanced or optimized ability to cleave nucleic acid substrates, 
preferably RNA substrates. As those of skill in the art will appreciate, the rate of an 
enzyme-catalyzed reaction varies depending upon the substrate and enzyme 

35 concentrations and, in general, levels off at high substrate or enzyme concentrations. 

Taking such effects into account, the kinetics of an enzyme-catalyzed reaction may be 
described in the following terms, which define the reaction. 
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The enhanced or optimized ability of an enzymatic DNA molecule of the present 
invention to cleave an RNA substrate may be determined in a cleavage reaction with 
varying amounts of labeled RNA substrate in the presence of enzymatic DNA molecule. 
The ability to cleave the substrate is generally defined by the catalytic rate (k„,l divided 
5 by the Michae.is constant IKJ. The symbol k M , represents the maxima, velocity of an 

enzyme reaction when the substrate approaches a saturation value. K H represents the 
substrate concentration at which the reaction rate is one-half maximal. 

For example, values for K M and k el , may be determined in this invention by 
experiments in which the substrate concentration IS) is in excess over enzymatic DNA 
10 molecule concentration [El. Initial rates of reaction (v.) over a range of substrate 

concentrations are estimated from the initial .inear phase, genera.ly the first 5% or less 
of the reaction. Data points are fit by a .east squares method to e theoretical line given 
by the equation: v = « M (v„/[Sl> + V m „. Thus. and K M are determined by the .n.t.a. 
rate of reaction, v., and the substrate concentration IS]. 
, 5 ,n various a.ternative embodiments, an enzymatic DNA molecule of the present 

invention has an enhanced or optimized ability to cleave nucleic acid substrates, 
preferably RNA substrates. In preferred embodiments, the enhanced or optimized abdrty 
of an enzymatic DNA mo.ecu.e to cleave RNA substrates shows about a 10- to 10Mo.d 
improvement over the uncata.yzed rate. In more preferred embodiments, en enzymat.c 
20 DNA mo.ecu.e of the present invention is able to c.eave RNA substrates at a rate that « 

about 10 s - to lO'-lold improved over "progenitor" species. In even more preferred 
embodiments, the enhanced or optimized ability to c.eave RNA substrates is expressed 
as a 10*- to lOMold improvement over the progenitor species. One skilled in the art w,ll 
appreciate that the enhanced or optimized ability of an enzymatic DNA mo.ecu.e to 
25 cleave nucleic acid substrates may vary depending upon the selection constants 

applied during the in vitro evolution procedure of the invention. 

Various preferred methods of modifying deoxyribozymes and other enzymafc 
DNA mo.ecu.es and nuc.eases of the present invention are further described in Examples 
1-3 hereinbelow. 
30 C. Mufilflflflflg Analogs 

As noted above, the term "nuc.eotide analog" as used herein generally refers to 
a purine or pyrimidine nucleotide that differs structurally from A. T. G. C. or U. but « 
sufficiently similar to substitute for such "norma." nuc.eotides in a nucleic acid mo.ecu.e. 
As used herein, the term "nuc.eotide ana.og" encompasses a.tered bases, different (or 
35 unusual) sugars, a.tered phosphate backbones, or any combination of these a.terat.ons. 

Examp,es of nuc.eotide ana.ogs useful according to the present invention include those 
Hsted in the following Table, most of which are found in the approved listing of mod.hed 
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bases at 37 CFR 51 .822 (which is incorporated herein by reference). 

Table 1 

Nucleotide Analogs 



Abbreviation 



Description, 



10 



15 



20 



25 



30 



35 



ac4c 4-acetylcytidine 

chm5u ' 5-(carboxyhydroxylmethyl}uridine 

cm 2'-0-methy(cytidine 

cmnm5s2u 5-carboxymethylaminomethyl-2-thiouridine 

d dihydrouridine 

fm 2'-0-methylpseudouridine 

galq B, D-galactosylqueosine 

gm 2'-0-methylguanosine 

I incline 

i6a N6-isopentenyladenosine 

m1a 1-methyladenosine 

m1f 1-methylpseudouridine 

mlg 1-methylguanosine 

mil 1-methylinosine 

m22g 2,2-dimethylguanosine 

m2a 2-methyladenosine 

m2g 2-methylguanosine 

m3c 3-methylcytidine 

m5c 5-rnethylcytidine 

rn6a N6-methy I adenosine 

m7g 7-methylguanosine 

mam5u 5-methylaminomethyluridine 

mam5s2u 5-me^hoxYaTrrinornethyl-2-thiouridine 

manq G>, O-mannosylmethyluridine 

mcm5s2u 5-rnethoxycarbonylmethyluridine 

mo5u 5-methoxyuridine 

ms2i6a 2-methylthio-N6-isopentenyladenosine 

ms2t6a N-{(9-tt-0-ribofuranosy)-2-methylthiopurine-6- 

yOcarbamoyijThreonine 

mt6a N-{(9-&-D-ribofuranosylpurine-6-yl)N-methyl- 

carbamoyOthreonine 
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(Table 1, cont'd) 



fthhrRviation Description 

mv uridine-5-oxyacetic acid methylester 

o5 u uridine-5-oxyacetic acid |v) 

0S y W wybutoxosine 

p pseudouridine 

q queosine 

S 2c 2-thiocytidine 

S 2t 5-methyt-2-thiouridine 

s 2u 2-thiouridine 

S 4 U 4-thiouridine 



t 5-methy!uridine 

t6a N.((9_B-D-ribofuranosylpurine-6-yl)carbamoYl)thraoninetm 

2 , -0-methyl-5-methyluridine 

um 2*-0-methyluridine 

y W wybutosine 

x 3-<3-amino-3-carboxypropyl)uridine, (acp3)u 
araU D-arabinosyl 

araT D-arabinosyl ■ 



Other useful analogs include those described in published international 
application no. WO 92/20823 (the disclosures of which are incorporated herein by 
reference), or analogs made according to the methods disclosed therein. Analogs 
described in DeMesmaeker, et al., ftm-W r h em, InT Frf. Engl, 33: 226-229 (1994); 
DeMesmaeker, et al.. Svnlett : 733-736 (Oct. 1993); Nielsen, et al., SfiifiQCfi 254 : 1497- 
1500 (1991); and Idziak, et al., T^rlmn IftttOT 34: 5417-5420 (1993) are also 
useful according to the within-disclosed invention and said disclosures are incorporated 

by reference herein. 

0. M qtfrnHs of Ea a iPfiglina Fn7vmatir PNA Molflcules 

The present invention elso contemplates methods of producing nucleic acid 

molecules having a predetermined activity. In one preferred embodiment, the nucleic 

acid molecule is an enzymatic DNA molecule. In another variation, the desired activity is 

a catalytic activity. 

In one embodiment, the present invention contemplates methods of synthesizing 
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enzymatic DNA molecules that may then be "engineered" to catalyze a specific or 
predetermined reaction. Methods of preparing enzymatic DNA molecules are described 
herein; see, e.g.. Examples 1-3 hereinbelow. In other embodiments, an enzymatic DNA 
molecule of the present invention may be engineered to bind small molecules or ligands, 
5 such as adenosine triphosphate (ATP). (See, e.g., Sassanfar, et al.. Nature 364 : 550- 

553 (1993).) 

In another embodiment, the present invention contemplates that a population of 
enzymatic DNA molecules may be subjected to mutagenizing conditions to produce a 
diverse population of mutant enzymatic DNA molecules (which may alternatively be 

10 called "deoxyribozymes" or "DNAzymes"). Thereafter, enzymatic DNA molecules having 

desired characteristics are selected and/or separated from the population and are 
subsequently amplified. 

Alternatively, mutations may be introduced in the enzymatic DNA molecule by 
altering the length of the recognition domains of the enzymatic DNA molecule. The 

15 recognition domains of the enzymatic DNA molecule associate with a complementary 

sequence of bases within a substrate nucleic acid sequence. Methods of altering the 
length of the recognition domains are known in the art and include PCR, for example; 
useful techniques are described further in the Examples below. 

Alteration of the length of the recognition domains of an enzymatic DNA 

20 molecule may have a desirable effect on the binding specificity of the enzymatic DNA 

molecule. For example, an increase in the length of the recognition domains may 
increase binding specificity between the enzymatic DNA molecule and the 
complementary base sequences of an oligonucleotide in a substrate, or may enhance 
recognition of a particular sequence in a hybrid substrate. In addition, an increase in the 

25 length of the recognition domains may also increase the affinity with which it binds to 

substrate. In various embodiments, these altered recognition domains in the enzymatic 
DNA molecule confer increased binding specificity and affinity between the enzymatic 
DNA molecule and its substrate. 

It has recently been noted that certain oligonucleotides are-ahJe-to Eeeoginize^and 

30 bind molecules other than oligonucleotides with complementary sequences. These 

oligonucleotides are often given the name "aptamers". For example, Ellington and 
Szostak describe RNA molecules that are able to bind a variety of organic dyes (N^TWrg 
346 : 818-822 (1990)}, while Bock, et al. describe ssDNA molecules that bind human 
thrombin ( Nature 355 : 564-566 (1992)). Similarly, Jellinek, et al. describe RNA ligands 

35 to basic fibroblast growth factor ( PNAS USA 90 : 11227-11231 (1993)). Thus, it is 

further contemplated herein that the catalytically active DNA enzymes of the present 
invention may be engineered according to the within-described methods to display a 
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variety of capabilities typically associated with aptamers. 

One of skill in the art should thus appreciate that the enzymatic DNA molecules 
of this invention can be altered at any nucleotide sequence, such as the recognition 
domains, by various methods disclosed herein, including PCR and 3SR (self-sustained 
5 sequence replication - see Example 1 below). For example, additional nucleotides can 

be added to the 5' end of the enzymatic DNA molecule by including additional 

nucleotides in the primers. 

Enzymatic DNA molecules of the present invention may also be prepared or 
engineered in a more non-random fashion via use of methods such as site-directed 
1 0 mutagenesis. For example, site-directed mutagenesis may be carried out essentially as 

described in Morinaga, et al., Bfllafralfifly 2: 636 (1984). modified as described 
herein, for application to deoxyribozymes. -Useful methods of engineering enzymat,c 
DNA molecules are further described in the Examples below. 

In one disclosed embodiment, an enzymatic DNA molecule of the present 
1 5 invention comprises a conserved core flanked by two substrate binding lor recognition) 

domains or sequences that interact with the substrate through base-pairing interact.ons. 
in various embodiments, the conserved core comprises one or more conserved domains 
or sequences. In another variation, an enzymatic DNA molecule further comprises a 
-spacer' region lor sequence) between the regions (or sequences) involved in base 
30 pairing. In still another variation, the conserved core is "interrupted" at various intervals 

by one or more less-conserved variable or "spacer" nucleotides. 

In various embodiments, the population of enzymatic ONA molecules is made up 
of at least 2 different types of deoxyribozyme molecules. For example, in one variation, 
the molecules have differing sequences. In another variation, the deoxyribozymes are 
25 nucleic acid molecules having a nucleic acid sequence defining a recognition domain that 

is contiguous or adjacent to the S'-termlnus of the nucleotide sequence. In venous 
alternative embodiments, enzymatic DNA molecules of the present invention may further 
, comprise one or more spacer regions located T-termina. to the recognition domains, one 
or more loops located T-terminal to the recognition domains and/or spacer reg.ons. In 
30 other variations, a deoxyribozyme of the present invention may comprise one or more 

regions which are capable of hybridizing to other regions of the same molecule. Other 
characteristics of enzymatic DNA molecules produced according to the presently- 
disclosed methods are described elsewhere herein. 

In other embodiments, mutagenizing conditions include conditions that introduce 
35 either defined or rendom nucleotide substitutions within an enzymatic DNA molecule. 

Examples of typical mutagenizing conditions include conditions disclosed in other parts 
of this specification and the methods described by Joyce et al.. Nud AfflweRM . 17 .: 
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71 1-722 (1989); Joyce, Gene 82 : 83-87(1989); and Beaudry and Joyce, Science 257 : 
635-41 (1992). 

In still other embodiments, a diverse population of mutant enzymatic nucleic acid 
molecules of the present invention is one that contains at least 2 nucleic acid molecules 
5 that do not have the exact same nucleotide sequence. In other variations, from such a 

diverse population, an enzymatic DNA molecule or other enzymatic nucleic acid having a 
predetermined activity is then selected on the basis of its ability to perform the 
predetermined activity. In various embodiments, the predetermined activity comprises, 
without limitation, enhanced catalytic activity, decreased K M , enhanced substrate 

10 binding ability, altered substrate specificity, and the like. 

Other parameters which may be considered aspects of enzyme performance 
include catalytic activity or capacity, substrate binding ability, enzyme turnover rate, 
enzyme sensitivity to feedback mechanisms, and the like. In certain aspects, substrate 
specificity may be considered an aspect of enzyme performance, particularly in 

1 5 situations in- which an enzyme is able to recognize and bind two or more competing 

substrates, each of which affects the enzyme's performance with respect to the other 
substrate(s). 

Substrate specificity, as used herein, may refer to the specificity of an enzymatic 
nucleic acid molecule as described herein for a particular substrate, such as one 

20 comprising ribonucleotides only, deoxyribonucleotides only, or a composite of both. 

Substrate molecules may also contain nucleotide analogs. In various embodiments, an 
enzymatic nucieic acid molecule of the present invention may preferentially bind to a 
particular region of a hybrid or non-hybrid substrate. 

The term or parameter identified herein as "substrate specificity" may also 

25 include sequence specificity; i.e., an enzymatic nucleic acid molecule of the present 

invention may "recognize" and bind to a nucleic acid substrate having a particular 
nucleic acid sequence. For example, if the substrate recognition domains of an 
enzymatic nucleic acid molecule of the present invention will only bind to substrate 
molecules having a series of one or two ribonucleotides (e.g., rA) in a row, then the 

30 enzymatic nucleic acid molecule will tend not to recognize or bind nucleic acid substrate 

molecules lacking such a sequence. 

With regard to the selection process, in various embodiments, selecting includes 
any means of physically separating the mutant enzymatic nucleic acids having a 
predetermined activity from the diverse population of mutant enzymatic nucleic acids, 

35 Often, selecting comprises separation by size, by the presence of a catalytic activity, or 

by hybridizing the mutant nucleic acid to another nucieic acid, to a peptide, or some 
other molecule that is either in solution or attached to a solid matrix. 
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ln various embodiments, the predetermined activity is such that the mutant 
enzymatic nucleic acid having the predetermined activity becomes labeled in some 
fashion by virtue of the activity. For example, the predetermined activity may be an 
enzymatic DNA molecule activity whereby the activity of the mutant enzymatic nucleic 
5 acid upon its substrate causes the mutant enzymatic nucleic acid to become covalently 

linked to it. The mutant enzymatic nucleic acid is then selected by virtue of the 
covalent linkage. 

In other embodiments, selecting a mutant enzymatic nucleic acid having a 
predetermined activity includes amplification of the mutant enzymatic nucleic acid (see. 
10 eg., Joyce, n»ne 82 : 83-87 11989); Beaudry and Joyce, 2 57 .: 635-41 (1992)). 

Other methods of selecting an enzymatic nuc.eic acid molecule having a predetermined 
characteristic or activity are described in the Examples section. 
E. Cnrpnnsitions 

The invention also contemplates compositions containing one or more types or 
1 5 populations of enzymatic DNA molecules of the present invention; e.g.. different types 

or populations may recognize and cleave different nucleotide sequences. Compositions 
may further include a ribonucleic acid-containing substrate. Compositions accordmg to 
the present invention may further comprise .ead ion. magnesium ion. or other divalent or 
. monovalent cations, as discussed herein. 

Preferably, the enzymatic DNA molecule is present at a concentration of about 

0 05 „M to about 2 pM. Typical.y. the enzymatic DNA molecu.e is present at a 
concentration ratio of enzymatic DNA molecu.e to substrate of from about 1:5 to about 

1 -50 More preferably, the enzymatic DNA molecule is present in the compos.t.on at a 
concentration of about 0.1 pM to about 1 pM. Even more preferab.y, compositions 

25 contain the enzymatic DNA mo.ecule at a concentration of about 0.1 pM to about 0.5 

„M. Preferably, the substrate is present in the composition at a concentration of about 

0.5 pM to about 1000 fM. 

One skilled in the art will understand that there are many sources of nucleic 
acid-containing substrates induding-natur^occor^and .syntheticsoucces. Sources 
30 . of suitable substrates include, without limitation, a variety of viral and retroviral agents, 
including HIV-1, HIV-2, HTLV-I. and HTLV-II. 

Other suitable substrates include, without limitation, viral and retroviral agents 
including those comprising or produced by picornaviruses, hepadnaviridae (e.g.. HBV, 
HCV), papillomaviruses (e.g., HPV). gammaherpesvirinae (e.g.. EBV), 
35 lymphocryptoviruses. leukemia viruses (e.g.. HTLV-I and -ID. f.aviviruses. togaviruses, 

herpesviruses (including alphaherpesviruses and betaherpesviruses). cytomegalov.ruses 
(CMV) influenza viruses, and viruses and retroviruses contributing to immunodefcency 
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diseases and syndromes (e.g., HIV-1 and -2). In addition, suitable substrates include 
viral and retroviral agents which infect non-human primates and other animals including, 
without limitation, the simian and feline immunodeficiency viruses and bovine leukemia 
viruses. 

Magnesium ion, lead ion, or another suitable monovalent or divalent cation, as 
described previously, may also be present in the composition, at a concentration ranging 
from about 1-100 mM. More preferably, the preselected ion is present in the 
composition at a concentration of about 2 mM to about 50 mM, with a concentration of 
about 5 mM being particularly preferred. One skilled in the art will understand that the 
ion concentration is only constrained by the limits of solubility of its source (e.g. 
magnesium) in aqueous solution and a desire to have the enzymatic DNA molecule 
present in the same composition in an active conformation. 

The invention also contemplates compositions containing an enzymatic DNA 
molecule of the present invention, hybrid deoxyribonucleotide-ribonucieotide molecules, 
and magnesium or lead ion in concentrations as-describedhereieabxave.- As noted 
previously, other monovalent or divalent ions (e.g., Ca 2+ ) may be used in place of 
magnesium. 

Also contemplated by the present invention are compositions containing an 
enzymatic DNA molecule of the present invention, nucleic acid-containing substrate (e.g. 
RNA), and a preselected ion at a concentration of greater than about 1 miliimolar, 
wherein said substrate is greater in length than the recognition domains present on the 
enzymatic DNA molecule. 

In one variation, a composition comprises an enzymatic DNA molecule-substrate 
complex, wherein base pairing between an enzymatic DNA molecule and its substrate is 
contiguous. In another embodiment, base pairing between an enzymatic DNA molecule 
and its substrate is interrupted by one or more noncomplementary pairs. In a variety of 
alternative embodiments, a composition of the present invention may further comprise a 
monovalent cation, a divalent cation, or both. 

In another variation, an enzymatic DNA moJecuJe of the present invention is 
capable of functioning efficiently in the presence or absence of a divalent cation. In one 
variation, a divalent cation is present and comprises Pb 2 *, Mg 2 + , Mn 2 \ Zn 2 + , or Ca 2 + . 
Alternatively, an enzymatic DNA molecule of the present invention is capable of 
functioning efficiently in the presence or absence of monovalent cations. It is 
anticipated that monovalent or divalent cation concentrations similar to those described 
herein for Pb 2 * or Mg 2 * will be useful as disclosed herein. 

Optionally, monovalent cations may also be present in addition to, or as 
"alternatives" for, divalent cations. For example, monovalent cations such as sodium 
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(Na*) or potassium (K*) may be present, either as dissociated ions or in the form of 
dissociable compounds such as NaCJ or KCl. 

In one embodiment, the concentration of monovalent cation present in the 
composition ranges from 0-1.0 M. In another embodiment, a monovalent cation is 
5 present in a concentration ranging from about 0-200 mM. In other embodiments, 

monovalent cations are present in a concentration ranging from about 1-100 mM. 
Alternatively, the concentration of monovalent cations ranges from about 2 mM - 50 
mM. In still other embodiments, the concentration ranges from about 2 mM - 25 mM. 
F. Methods of Using Enzvmatic ONA Molecules 

10 The methods of using enzymatic DNA molecules as disclosed herein are legion. 

As discussed previously, molecules capable of cleaving the bonds linking neighboring 
nucleic acids (e.g., phosphoester bonds} have numerous uses encompassing a wide 
variety of applications. For example, enzymatic DNA molecules having the within- 
disclosed capabilities, structures, and/or functions are useful in pharmaceutical and 

15 medical products (e.g., for wound debridement, clot dissolution, etc.), as well as in 

household items |e,g., detergents, dental hygiene products, meat tenderizers). Industrial 
utility of the within-disclosed compounds, compositions and methods is also 
contemplated and well within the scope of the present invention. 

The present invention also describes useful methods for cleaving any single- 

20 stranded, looped, partially or fully double-stranded nucleic acid; the majority of these 

methods employ the novel enzymatically active nucleic acid molecules of the present 
invention. In various embodiments, the single-stranded nucleic acid segment or portion 
of the substrate (or the entire substrate Itself) comprises DNA, modified DNA, RNA, 
modified RNA, or composites thereof. Preferably, the nucleic acid substrate need only 

25 be single-stranded at or near the substrate cleavage sequence so that an enzymatic 

nucleic acid molecule of the present invention can hybridize to the substrate cleavage 
sequence by virtue of the enzyme's recognition sequence. 

A nucleic acid substrate that can be cleaved by a method of this invention may 
be chemically synthesized or enzymatically produced, or it may be isolated from various 

30 sources such as phages, viruses, prokaryotic cells, or eukaryatic cells, including animal 

cells, plant cells, yeast cells and bacterial cells. Chemically synthesized single- and 
double-stranded nucleic acids are commercially available from many sources including, 
without limitation, Research Genetics (Huntsville, AL)'. 

RNA substrates may also be synthesized using an Applied Biosystems (Foster 

35 City, CA) oligonucleotide synthesizer according to the manufacturer's instructions. 

Single-stranded phage are also a source of nucleic acid substrates. (See, e.g.. Messing 
et al., PNAS USA 74 : 3642-3646 (1977), and Yanisch-Perron et al., Gene 33: 103-119 
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(1985).) Bacterial cells containing single-stranded phage would also be a ready source 
of suitable single-stranded nucleic acid substrates. 

Single-stranded RNA cleavable by a method of the present invention could be 
provided by any of the RNA viruses such as the picornaviruses, togaviruses, 
5 orthomyxoviruses, paramyxoviruses, rhabdoviruses, coronaviruses, arenaviruses or 

retroviruses. As noted previously, a wide variety of prokaryotic and eukaryotic cells 
may also be excellent sources of suitable nucleic acid substrates. 

The methods of this invention may be used on single-stranded nucleic acids or 
single-stranded portions of looped or double-stranded nucleic acids that are present 

10 inside a cell, including eukaryotic, procaryotic, plant, animal, yeast or bacterial cells. 

Under these conditions an enzymatic nucleic acid molecule (e.g., an enzymatic DNA 
molecule or deoxyribozyme) of the present invention could act as an anti-viral agent or a 
regulator of gene expression. Examples of such uses of enzymatic DNA molecules of 
the present invention are described further hereinbelow. 

15 In the majority of methods of the present invention, cleavage o*f single-stranded 

nucleic acids occurs at the 3'-terminus of a predetermined base sequence. This 
predetermined base sequence or substrate cleavage sequence typically contains from 1 
to about 10 nucleotides. In other preferred embodiments, an enzymatic DNA molecule 
of the present invention is able to recognize nucleotides either upstream, or upstream 

20 and downstream of the cleavage site. In various embodiments, an enzymatic DNA 

molecule is able to recognize about 2-10 nucleotides upstream of the cleavage site; in 
other embodiments, an enzymatic DNA molecule is able to recognize about 2-10 
nucleotides upstream and about 2-10 nucleotides downstream of the cleavage site. 
Other preferred embodiments contemplate an enzymatic DNA molecule that is capable 

25 of recognizing a nucleotide sequence up to about 30 nucleotides in length, with a length 

up to about 20 nucleotides being even more preferred. 

The within-disclosed methods allow cleavage at any nucleotide sequence by 
altering the nucleotide sequence of the recognition domains of the enzymatic DNA 
molecule. This allows cleavage of single-stranded nucleic acid in the absence-of a 

30 restriction endonuclease site at the selected position. 

An enzymatic DNA molecule of the present invention may be separated from any 
portion of the single-stranded nucleic acid substrate that remains attached to the 
enzymatic DNA molecule by site-specific hydrolysis at the appropriate cleavage site. 
Separation of the enzymatic DNA molecule from the substrate (or "cleavage product") 

35 allows the enzymatic DNA molecule to carry out another cleavage reaction. 

Generally, the nucleic acid substrate is treated under appropriate nucleic acid 
cleaving conditions - preferably, physiologic conditions - with an effective amount of 



WO 96/17086 



PCTAJS95/15580 



-27- 

an enzymatic DNA molecule of the present invention. If the nucleic acid substrate 
comprises DNA, cleaving conditions may include the presence of a divalent cation at a 
concentration of about 2-1 OmM. 

An effective amount of an enzymatic DNA molecule is the amount required to 
5 cleave a predetermined base sequence present within the single-stranded nucleic acid. 

Preferably, the enzymatic DNA molecule is present at a molar ratio of DNA molecule to 
substrate cleavage sites of 1 to 20. This ratio may vary depending on the length of 
treating and efficiency of the particular enzymatic DNA molecule under the particular 
nucleic acid cleavage conditions employed. 

10 Thus, in one preferred embodiment, treating typically involves admixing, in 

aqueous solution, the RNA-containing substrate and the enzyme to form a cleavage 
admixture, and then maintaining the admixture thus formed under RNA cleaving 
conditions for a time period sufficient for the enzymatic DNA molecule to cleave the 
RNA substrate at any of the predetermined nucleotide sequences present in the RNA. In 

1 5 various embodiments, a source of ions is also provided - i.e. monovalent or divalent 

cations, or both. 

In one embodiment of the present invention, the amount of time necessary for 
the enzymatic DNA molecule to cleave the single-stranded nucleic acid has been 
predetermined. The amount of time is from about 1 minute to about 24 hours and will 

20 vary depending upon the concentration of the reactants and the temperature of the 

reaction. Usually, this time period is from about 10 minutes to about 2 hours such that 
the enzymatic DNA molecule cleaves the single-stranded nucleic acid at any of the 
predetermined nucleotide sequences present. 

The invention further contemplates that the nucleic acid cleaving conditions 

25 include the presence of a source of divalent cations (e.g., PbOAc) at a concentration of 

about 2-100 mM. Typically, the nucleic acid cleaving conditions include divalent cation 
at a concentration of about 2 mM to about 10 mM, with a concentration of about 5 mM 

being particularly preferred. 

The-opiirnalncationic concentration to include in the nucleic acid cleaving 
30 conditions can be easily determined by determining the amount of single-stranded 

nucleic acid cleaved at a given cation concentration. One skilled in the art will 
understand that the optimal concentration may vary depending on the particular 
enzymatic DNA molecule employed. 

The present invention further contemplates that the nucleic acid cleaving 
35 conditions include a pH of about pH 6.0 to about pH 9.0, In one preferred embodiment. 

the pH ranges from about pH 6.5 to pH 8.0. In another preferred embodiment, the pH 
emulates physiological conditions, i.e.. the pH is about 7.0-7.8. with a pH of about 7.5 
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being particularly preferred. 

One skilled in the en will appreciate that the methods of the present invention 
will work over a wide pH range so long as the pH used for nucleic acid cleaving is such 
that the enzymatic DNA molecule is able to remain in an active conformation. An 
5 enzymatic DNA molecule in an active conformation is easily detected by its ability to 

cleave single-stranded nucleic acid at a predetermined nucleotide sequence. 

In various embodiments, the nucleic acid cleaving conditions also include a 
variety of temperature ranges. As noted previously, temperature ranges consistent with 
physiological conditions are especially preferred, although temperature ranges consistent 

10 with industrial applications are also contemplated herein. In one embodiment, the 

temperature ranges from about 15"C to about 60*C. In another variation, the nucleic 
acid cleaving conditions include a temperature ranging from about 30°C to about 56°C. 
In yet another variation, nucleic acid cleavage conditions include a temperature from 
about 35 *C to about 50*0. In a preferred embodiment, nucleic acid cleavage conditions 

1 5 comprise a - temperature range of about 37*C to about 42*C. The temperature ranges 

consistent with nucleic acid cleaving conditions are constrained only by the desired 
cleavage rate and the stability of that particular enzymatic DNA molecule at that 
particular temperature. 

In various methods, the present invention contemplates nucleic acid cleaving 

20 conditions including the presence of a polyamine. Polyamines useful for practicing the 

present invention include spermidine, putrescine, spermine and the like. In one 
variation, the polyamine is present at a concentration of about .01 mM to about 10 mM. 
In another variation, the polyamine is present at a concentration of about 1 mM to about 
10 mM. Nucleic acid cleavage conditions may also include the presence of polyamine at 

25 a concentration of about 2 mM to about 5 mM. In various preferred embodiments, the 

polyamine is spermidine. 
G. Vectors 

The present invention also features expression vectors including a nucleic acid 
segment encoding an enzymatic DNA molecule of the present invention situated within 
30 the vector, preferably in a manner which allows expression of that enzymatic DNA 

molecule within a target ceil (e.g., a plant or animal cell). 

Thus, in general, a vector according to the present invention preferably includes 
a plasmid, cosmid, phagemid, virus, or phage vector. Preferably, suitable vectors 
comprise single-stranded DNA {ssDNA) - e.g., circular phagemid ssONA. It should also 
35 be appreciated that useful vectors according to the present invention need not be 

circular. 

In one variation, nucleotide sequences flanking each of the additional enzymatic 
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DNA molecule-encoding sequences are preferably provided, which sequences may be 
recognized by the first enzymatic DNA molecule. The intervening or flanking sequences 
preferably comprise at least 1 nucleotide; more preferably, intervening or flanking 
sequences are about 2-20 nucleotides in length, with sequences of about 5-10 
5 nucleotides in length being particularly preferred. 

The addition of polynucleotide tails may also be useful to protect the 3' end of 
an enzymatic DNA molecule according to the present invention. These may be provided 
by attaching a polymeric sequence by employing the enzyme terminal transferase. 

A vector according to the present invention includes two or more enzymatic 
10 DNA molecules. In one embodiment, a first enzymatic DNA molecule has intramolecular 

cleaving activity and is able to recognize and cleave nucleotide sequences to release 
other enzymatic DNA sequences; i.e., it is able to function to "release" other enzymatic 
DNA molecules from the vector. For example, a vector is preferably constructed so that 
when the first enzymatic DNA molecule is expressed, that first molecule is able to 
1 5 cleave nucleotide sequences flanking additional nucleotide sequences encoding a second 

enzymatic DNA molecule, a third enzymatic DNA molecule, and so forth. Presuming 
said first enzymatic DNA molecule (i.e., the "releasing" molecule) is able to cleave 
oligonucleotide sequences intramolecularly, the additional (e.g. second, third, and so on) 
enzymatic DNA molecules (i.e., the "released" molecules) need not possess 
20 characteristics identical to the "releasing" molecule. For example, in one embodiment. 

the "released" (i.e.. the second, third, etc.) enzymatic DNA molecules are able to cleave 
specific RNA sequences, while the first ("releasing") enzymatic DNA molecule has 
nuclease activity allowing it to liberate the "released" molecules. In another 
embodiment, the "released" enzymatic DNA molecule has amide bond-cleaving activity, 
25 while the first ("releasing") enzymatic DNA molecule has nuclease activity. 

Alternatively, the first enzymatic DNA molecule may be encoded on a separate 
vector from the second (and third, fourth, etc.) enzymatic DNA molecule(s) and may 
have intermodular cleaving activity. As noted herein, the first enzymatic DNA 
molecule can be a self-cleaving enzymatic DNA motawaWteg*^^ 
30 the second enzymatic DNA molecule may be any desired type of enzymatic DNA 

molecule. When a vector is caused to express DNA from these nucleic acid sequences, 
that DNA has the ability under appropriate conditions to cleave each of the flanking 
regions, thereby releasing one or more copies of the second enzymatic DNA molecule. 
If desired, several different second enzymatic DNA molecules can be placed in the same 
35 cell or carrier to produce different deoxyribozymes. It is also contemplated that any one 

or more vectors may comprise one or more ribozymes or deoxyribozymes in any 
combination of "releasing" and "released" enzymatic nucleic acid molecules, as long as 
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such a combination achieves the desired result: the release of enzymatic nucleic acid 
molecules that are capable of cleaving predetermined nucleic acid sequences. 

Methods of isolating and purifying enzymatic DNA molecules of the present 
invention are also contemplated. In addition to the methods described herein, various 
purification methods (e.g. those using HPLC) and chromatographic isolation techniques 
are available in the art. See. e.g., the methods described in published international 
application no. WO 93/23569, the disclosures of which are incorporated herein by 
reference. 

It should also be understood that various combinations of the embodiments 
described herein are included within the scope of the present invention. Other features 
and advantages of the present invention will be apparent from .the descriptions 
hereinabove, from the Examples to follow, and from the claims. 

EXAMPLES 

The following examples illustrate, but do not limit, the present invention. 

Example 1 

In Vitro Evolution of EnzvmR fic DNA Molfimlg*; 
AP Qvgrvigw 

In vitro selection and in vitro evolution techniques allow new catalysts to be 
isolated without a priori knowledge of their composition or structure. Such methods 
have been used to obtain RNA enzymes with novel catalytic properties. For example, 
ribozymes that undergo autolytic cleavage with lead cation have been derived from a 
randomized pool of tRNA"" molecules (Pan and Uhlenbeck, Biochemistry 31 ? 3887-3895 
(1992)). Group I ribozyme variants have been isolated that can cleave DNA {Beaudry 
and Joyce, Science 257: 635-641 (1992)) or that have altered metal dependence 
(Lehman and Joyce, Nature 351; 182-185 (1993)). Starting with a pool of random RNA 
sequences, molecules have been obtained that catalyze a polymerase-like reaction 
(Bartel and Szostak, Science 261: 1411-1418(1993)). In the present example, 
refinement of specific catalytic properties'of *anrevo1veri*enzyme via alteration of the 
selection constraints during an in vitro evolution procedure is described. 

Darwinian evolution requires the repeated operation of three processes: (a) 
introduction of genetic variation; (b) selection of individuals on the basis of some fitness 
criterion; and (c) amplification of the selected individuals. Each of these processes can 
be realized in vitro (Joyce, Gene B2: 83 (1989)). A gene can be mutagenized by 
chemical modification, incorporation of randomized mutagenic oligodeoxynucleotides, or 
inaccurate copying by a polymerase. (See, e.g., Cadweil and Joyce, in PCR Methods 
and Applications 2: 28-33 (1992); Cadwell and Joyce, PCR Methods and Annlications 3 
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ISuppl,) : S136-S140 (1994); Chu, et al., Virolocv 98: 168 (1979); Shortle, et al., MfillL. 
Pnrvmol. 10Q : 457 (1983); Myers, et al., fomnfifi 229: 242 (1985); Matteucci, et al., 
Uu cl flic Acids Res. 11 : 31 13 (1983); Wells, at al., Gsng 34: 315 (1985); McNeil, et al.. 
Mnl CpII. Biol. 5 : 3545 (1985); Hutchison, et al., PNAS USA 83: 710 (1986); 
5 Derbyshire, et al-. Gene 46 : 145 (1986); Zakour, et al., NflTUfS 2 95 : 708 (1982); 

Lehtovaara, et al EtfllfilD Eno. 2 : 63 (1988); Leung, et al.. IftfihfllflUfl 1 : 11 11989); 

7hTH, " °' N1I17I. Res 19: 6052 <199P.) 

The gene product can be selected, for example, by its ability to bind a ligand or 
to carry out a chemical reaction. (See, e.g., Joyce, \sL (1989); Robertson and Joyce, 

10 Nature 344 : 467 (1990); Tuerk, et al., Sconce 249: 505 (1990).) The gene that 

corresponds to the selected gene product can be amplified by a reciprocal primer 
method, such as the polymerase chain reaction (PCR). (See, e.g., Saiki, et al., Science 
23fl: 1350-54 (1 985); Saiki, et al., SfiiflDSg 239: 487-491 (1988).) 

Alternatively, nucleic acid amplification may be carried out using self-sustained 

1 5 sequence replication (3SR). (See, e.g., Guatelli, et al., PNAS USA 87 : 1874 (1990), the 

disclosures of which are incorporated by reference herein.) According to the 3SR 
method, target nucleic acid sequences may be amplified (replicated) exponentially in 
vitro under isothermal conditions by using three enzymatic activities essential to 
retroviral replication: (1) reverse transcriptase, (2) RNase H, and (3) a DNA-dependent 

20 RNA polymerase. By mimicking the retroviral strategy of RNA replication by means of 

cDNA intermediates, this reaction accumulates cDNA and RNA copies of the original 
target. 

In summary, if one is contemplating the evolution of a population of enzymatic 
DNA molecules, a continuous series of reverse transcription and transcription reactions 

25 replicates an RNA target sequence by means of cDNA intermediates. The crucial 

elements of this design are (a) the oligonucleotide primers both specify the target and 
contain 5' extensions encoding the T7 RNA polymerase binding site, so that the 
resultant cDNAs are competent transcription templates; (b) cDNA synthesis can proceed 
to completion of both strands due to the degradation of template RNA In the 

30 intermediate RNA-DNA hybrid by RNase H; and (c) the reaction products (cDNA and 

RNA) can function as templates for subsequent steps, enabling exponential replication. 

If one is evolving enzymatic ONA molecules, various critical elements of this 
design are somewhat different, as disclosed in these Examples. For instance, (1) the 
oligonucleotide primers specify the target and are preferably "marked' or labeled in 

35 some fashion - e.g.. via biotinylation - so the resultant competent template strands are 

easily identified; and (2) the in vitro selection procedure used preferably depends upon 
the identification of the most favorable release mechanism. 
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A major obstacle to realizing Darwinian evolution in vitro is the need to integrate 
mutation and amplification, both of which are genotype-related, with selection, which is 
phenotype-related. In the case of nucleic acid enzymes, for which genotype and 
phenotype are embodied in the same molecule, the task is simplified. 
5 A. Design of Enzvmatic DNA MnlenulBs 

It is well known that single-stranded DNA can assume interesting tertiary 
structures. The structure of a "tDNA", for example, closely resembles that of the 
corresponding tRNA. (See Paquette, et at., Eur. J. Biochem. 1RA : 259-265 (1990).) 
Furthermore, it has been possible to replace as many as 31 of 35 ribonucleotides within 

10 a hammerhead ribozyme, while retaining at least some catalytic activity. (See Perreault, 

et al., Nature 344: 565-567 (1990); Williams, et al., Proc. Natl. Acad. Sri. USA ffQ - 
918*921 (1992); Yang, et al., Biochemistry 31 : 5005-5009 (1992).) 

In vitro selection techniques have been applied to large populations of 
random-sequence DNAs, leading to the recovery of specific DNA "aptamers" that bind a 

1 5 target ligand with high affinity (Bock, et al., Nature 355 : 564-566 (1 992); Ellington & 

Szostak, Nature 355: 850-852 (1992); Wyatt & Ecker, PNAS USA 91 : 1356-1360 
(1 994)). Recently, two groups carried out the first NMR structural determination of an 
aptamer, a 15mer DNA that forms a G-quartet structure and binds the protein thrombin 
with high affinity (Wang, et al.. Biochemistry 32 : 1899-1904 (1993); Macaya, et al., 

20 PNAS USA 9,0: 3745-3749 (1993)). These findings were corroborated by an X-ray 

crystallographic analysis (Padmanabhan, et al,, J. Biol. Chem. 268 : 17651-17654 
(1993)). 

The ability to bind a substrate molecule with high affinity and specificity is a 
prerequisite of a goad enzyme. In addition, an enzyme must make use of 

25 well-positioned functional groups, either within itself or a cofactor, to promote a 

particular chemical transformation. Furthermore, the enzyme must remain unchanged 
over the course of the reaction and be capable of operating with catalytic turnover. 
Some would add the requirement that it be an informational macromolecule, comprised 
of subunits whose specific ordering is responsible for catalytic activity. While- these 

30 criteria are open to debate on both semantic and chemical grounds, they serve to 

distinguish phenomena of chemical rate enhancement that range from simple solvent 
effects to biological enzymes operating at the limit of substrate diffusion (Albery & 
Knowtes, Biochemistry 15: 5631-5640 (1976)). 

As described in greater detail hereinbelow, we sought to develop a general 

35 method for rapidly obtaining DNA catalysts and DNA enzymes, starting from random 

sequences. As an initial target, we chose a reaction that we felt was well within the 
capability of DNA: the hydrolytic cleavage of an RNA phosphodiester, assisted by a 
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divalent metal cofactor. This is the same reaction that is carried out by a variety of 
naturally-occurring RNA enzymes, including the hammerhead and hairpin motifs. (See, 
e.g., Forster A.C. & Symons R.H., Cell 49 : 211-220 (1987); Uhlenbeck, Nature 328: 
596-600 (1987); Hampel & Tritz, BiochemiSUV 28: 4929-4933 (1989)1. 
5 It has recently been shown that, beginning with a randomized library of tRNA 

molecules, one can obtain ribozymes that have Pb 2+ -dependent, site-specific RNA 
phosphoesterase activity at neutral pH (Pan & Uhlenbeck, Biochemistry 31: 3887-3895 
(1992); Pan & Uhlenbeck, Nature 358 : 560-563 (1992)). This is analogous to the 
fortuitous self-cleavage reaction of yeast tRNA**" (Dirheimer & Werner, BlQChimie 54 : 

10 127-144 (1972)}, which depends on specific coordination of a Pb 2 * ion at a defined site 

within the tRNA. (See Rubin & Sundaralingam, J, Piomc-f. Struct, PYn. 1: 639-646 
(1983); Brown, et al., Rinrhflmistrv 24: 4785-4801 (1985).) 

As disclosed herein, our goals included the development of DNAs that could 
carry out Pb 2 *-dependent cleavage of a particular RNA phosphoester, initially presented 

1 5 within a short leader sequence attached to the 5" end of the DNA, and ultimately 

located within a separate molecule that- could be-rieaved in an intermodular fashion 
with rapid catalytic turnover. These goals were successfully achieved, as described 
further below. 

No assumptions were made as to how the DNA would interact with the target 

20 phosphoester and surrounding nucleotides. Beginning with a pool of approximately .1 0 14 

random 50mer sequences, in vitro selection was allowed to run its course. After five 
rounds of selection carried out over four days, the population as a whole had attained 
the ability to cleave the target phosphoester in the presence of 1 mM Pb 2 * at a rate of 
about 0.2 min \ This is an approximately 10 5 -fold increase compared to the 

25 spontaneous rate of cleavage under the same reaction conditions. 

Individuals were isolated from the population, sequenced, and assayed for 
catalytic activity. Based on this information, the reaction was converted to an 
intermodular format and then simplified to allow site-specific cleavage of a 19mer 
substrate by a- 38mer DNA enzyme, in a reaction that proceeds with a turnover rate of 1 

30 mm' at 23°C and pH 7.0 in the presence of 1 mM PbOAc. 

B. (n vitm Selection Scheme 

A starting pool of approximately 10 1 * single-stranded DNA molecules was 
generated, all of which contain a 5' biotin moiety, followed successively by a fixed 
domain that includes a single ribonucleotide, a potential catalytic domain comprised of 

35 50 random deoxyribonucleotides, and a second fixed domain that lay at the 3' terminus 

(Fig. D. 

The pool was constructed by a nested PCR (polymerase chain reaction) 
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technique, beginning with synthetic DNA that contained 50 random nucleotides flanked 
by primer binding sites. The nested PCR primer was a S'-biotinylated synthetic 
oligodeoxynucleotide with a 3'-terminal adenosine ribonucleotide. 

Ribonucleotide-terminated oligonucleotides efficiently prime template-directed elongation 
in the context of the PCR (L.E. Orgel, personal communication), in this case giving rise 
to an extension product that contains a single embedded ribonucleotide. 

Figure 1 illustrates a selective amplification scheme for isolation of DNAs that 
cleave a target RNA phosphoester. Double-stranded DNA containing a stretch of 50 
random nucleotides is amplified via PCR, employing a 5'-biotinylated DNA primer {e.g., 
primer 3 « 3a or 3b) terminated at the 3' end by an adenosine ribonucleotide 
(represented by the symbol "N" or "rA\ wherein both N and rA represent an adenosine 
ribonucleotide). This primer is extended by Taq polymerase to yield a DNA product that 
contains a single embedded ribonucleotide. The resulting double-stranded DNA is 
immobilized on a streptavidin matrix and the unbiotinylated DNA strand is removed by 
washing with 0.2 N NaOH. After re-equilibrating the column with a buffered solution, 
the column is-wiJsheiaVwrth the same solution with added 1 mM PbOAc. DNAs that 
undergo Pb 2 * -dependent self-cleavage are released from the column, collected in the 
eluant, and amplified by PCR. The PCR products are then used to initiate the next round 
of selective amplification. 

The PCR products were passed over a streptavidin affinity matrix, resulting in 
noncovalent attachment of the S'-biotinylated strand of the duplex DNA. The 
nonbiotinylated strand was removed by brief washing with 0.2 N NaOH, and the bound 
strand was equilibrated in a buffer containing 0.5 M NaCI, 0.5 M KCI, 50 mM MgCI 3 , 
and 50 mM HEPES (pH 7.0) at 23*C. Next, 1 mM PbOAc was provided in the same 
buffer, allowing Pb 2 *-dependent cleavage to occur at the target phosphoester, thereby 
releasing a subset of the DNAs from the streptavidin matrix. In principle, an individual 
DNA might facilitate its own release by various means, such as disruption of-the 
interaction between biotin and streptavidin or cleavage of one of the 
deTSxyritrorracleottde linkages. It was felt that cleavage of the ribonucieoside 3 '-0-P 
bond would be the most likely mechanism for release, based on the relative lability of 
this linkage, and that Pb 2+ -dependent hydrolytic cleavage would allow release to occur 
most rapidly. In principle, however, the in vitro selection procedure should identify the 
most favorable release mechanism as well as those individuals best able to carry out 
that mechanism. 

DNA molecules released from the matrix upon addition of Pb 2 * were collected in 
the eluant, concentrated by precipitation with ethanol, and subjected to nested PCR 
amplification. As in the construction of the starting pool of molecules, the first PCR 
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amplification utilized primers that flank the random region (primers 1 and 2) and the 
second utilized a 5*-biotinylated primer (primer 3b) that has a 3 '-terminal riboadenylate, 
thereby reintroducing the target RNA phosphoester. The entire selective amplification 
procedure requires 3-4 hours to perform. 
5 The molecules are purified in three ways during each round of this procedure: 

first, following PCR amplification, by extracting twice with phenol and once with 
chloroform / isoamyl alcohol, then precipitating with ethanol; second, following 
attachment of the DNA to streptavidin, by washing away all the nonbiotinylated 
molecules under strongly denaturing conditions; and third, following elution with Pb 2 *, 
1 0 by precipitating with ethanol. There is no gel electrophoresis purification step, and thus 

no selection pressure constraining the molecules to a particular length. 

C. Sfi 'ff rTinn of ^talvtic PNA 

We carried out five successive rounds of in vitro selection, progressively 
decreasing the reaction time following addition of Pb 2 * in order to progressively increase 

1 5 the stringency of selection. During rounds 1 though 3, the reaction time was 1 hour; 

during round 4, the reaction time was 20 minutes; and during round 5, it was 1 minute. 
The starting pool of single-stranded DNAs, together with the population of molecules 
obtained after each round of selection, was assayed for self-cleavage activity under 
conditions identical to those employed during in vitro selection (see Fig. 2). 

20 For this assay, the molecules were prepared with a 5'-"P rather than a 5'-biotin 

moiety, allowing detection of both the starting material and the 5' cleavage product. 
Following a 5-minute incubation, there was no detectable activity in the initial pool (GO) 
or in the population obtained after the first and second rounds of selection. DNAs 
obtained after the third round (G3) exhibited a modest level of activity; this activity 

25 increased steadily, reaching approximately 50% self-cleavage for the DNAs obtained 

after the fifth round of selection (G5). Cleavage was detected only at the target 
phosphoester, even after long incubation times. This activity was lost if Pb^ was 
omitted from the reaction mixture. 

Figure 2 illustrates the self-cleavage activity of the starttTrg- p^ofr DN*T <G0) 

30 and populations obtained after the first through fifth rounds of selection (G1 - G5). 

Reaction mixtures contained 50 mM MgCI 2 , 0.5 M NaCI. 0.5 M KCJ, 50 mM HEPES (pH 
7.0 at 23-C). and 3 nM [5'- M P]-labeled DNA, incubated at 23'C for 5 min either in the 
presence or inthe absence of 1 mM PbOAc. The symbol Pre represents 108-nucleotide 
precursor DNA (SEQ ID NO 41; Civ. 28-nucleotide 5'-cleavage product (SEQ ID NO 5); 

35 and M, primer 3a (SEQ ID NO 61, corresponding in length to the 5'-cleavage product. 

The 28-nucleotide 5* cleavage product (Civ) illustrated preferably has the 
sequence 5'-GGGACGAATTCTAATACGACTCACTATN-3\ wherein "N" represents 
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adenosine ribonucleotide with an additional 2\ 3*-CYclic phosphate on the 3' end (SEQ 
ID NO 5). In alternative embodiments, "N" represents adenosine ribonucleotide with an 
additional 2' or 3' phosphate on the 3' end of the molecule. 

In Figure 2, the "GO" lane "Pre" band comprises a sampling of 1 08-nucleotide 
5 precursor DNAs that each include 50 random nucleotides. Therefore, any given "Pre" 

sampling will contain a wide variety of precursor DNAs, and each sampling will likely 
differ from previous and subsequent samplings. The "GV through "G5" lanes contain 
"Pre" bands that are increasingly enriched for catalytic DNA molecules, but still contain 
a large number of different DNA sequences (i.e.. differing in the 50 nucleotide 
10 randomized domain). A sample of these different sequences from "G5 Pre" DNA is 

provided in Figure 3. 

Shotgun cloning techniques were employed to isolate individuals from the G5 
population; the complete nucleotide sequences of 20 of these subclones were then 
determined (see Fig. 3). (Also see, e.g., Cadwell and Joyce, in PCR-Methods-and 
1 5 Applications 2 : 28-33 (1992); Cadwell and Joyce. PCR Methods and Applications 3 

(SupdIJ: S136-S140 (1994).) Of the 20 sequences, five were unique, two occurred 
twice, one occurred three times, and one occurred eight times. All of the individual 
variants share common sequence elements within the 50-nucleotide region that had 
been randomized in the starting pool of DNA. They all contain two presumed, template 
20 regions, one with complementarity to a stretch of nucleotides that lies just upstream 

from the cleavage site and the other with complementarity to nucleotides that lie at 
least four nucleotides downstream. Between these two presumed template regions lies 
a variable domain of 1-11 nucleotides, followed by the fixed sequence 5'-AGCG-3\ then 
a second variable domain of 3-8 nucleotides, and finally the fixed sequence 5'-CG-3' or 
25 5'-CGA-3\ Nucleotides that lie outside of the two presumed template regions are highly 

variable in both sequence and length. In all of the sequenced subclones, the region 
corresponding to the 50 initially-randomized nucleotides remains a total of 50 
nucleotides in length. 

Figure 3 illustrates"the=sequence:alignmeBt-of individual variants isolated from 
30 the population after five rounds of selection. The fixed substrate domain (5'- 

GGGACGAATTCTAATACGACTCAGTATrAGGAAGAGATGGCGAC-3', or 5'- 
GGGACGAATTCTAATACGACTCACTATNGGAAGAGATGGCGAC-3', where N represents 
adenosine ribonucleotide) (SEQ ID NO 13) is shown at the top, with the target 
riboadenylate identified with an inverted triangle. Substrate nucleotides that are 
35 commonly involved in presumed base-pairing interactions are indicated by a vertical bar. 

Sequences corresponding to the 50 initially-randomized nucleotides are aligned 
antiparaliel to the substrate domain. All of the variants are 3'-terminated by the fixed 
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sequence 5'-CGGTAAGCTTGGCAC-3' !SEQ ID NO 1) ("primer site"; not shown). 
Nucleotides within the initially-randomized region that are presumed to form base pairs 
with the substrate domain are indicated on the right and left sides of the Figure; the 
putative base-pair-forming (or substrate binding) regions of the enzymatic DNA 
5 molecules are individually boxed in each sequence shown. The highly-conserved 

nucleotides within the putative catalytic domain are illustrated in the two boxed 
columns. 

While it is anticipated that additional data will be helpful in constructing a 
meaningful secondary structural model of the catalytic domain, we note that, like the 
1 0 hammerhead and hairpin ribozymes. the catalytic domain of our enzymatic DNA 

molecules appears to contain a conserved core flanked by two substrate binding regions 
(or recognition domains) that interact with the substrate through base-pairing 
interactions. Similar to the hammerhead and hairpin ribozymes. the catalytic DNAs also 
appear to require a short stretch of unpaired substrate nucleotides - in this case 
1 5 5'-GGA-3' -- between the two regions that are involved in base pairing. 

It was also interesting to note that each of the nine distinct variants exhibited a 
different pattern of presumed complementarity with the substrate domain. In some 
cases, base pairing was contiguous, while in others it was interrupted by one or more 
noncomplementary pairs. The general tendency seems to be to form tighter interaction 
with the nucleotides that lie upstream from the cleavage site compared to those that l.e 
downstream. Binding studies and site-directed mutagenesis ana.ysis should enable us to 
gain further insights and to further substantiate this conjecture. 

In order to gain further insight into the sequence requirements for catalytic 
function, the self-cleavage activity of six of the nine variants was tested and evaluated 
25 under the within-described selection conditions (see Fig. 3). Not surprisingly, the 

sequence that occurred in eight of the 20 subclones proved to be the most reactive, 
with a first-order rate constant of 1 .4 min '. All of the studied variants were active in 
the self-cleavage assay and all gave rise to a single B'-labeled product correspond to 
cleavage at the target RNA phosphoester. 

The dominant subclone was further analyzed under a variety of reaction 
conditions. Its self-cleavage activity was dependent on Pb" but was unaffected if 
Mg" was omitted from the reaction mixture. There was a requirement for a 
monovalent cation as well, which can be met by either N.* or K*. The reaction rate 
increased linearly with increasing concentration of monovalent cation over the range of 
35 0 - 1 0 M [r = 0.998). Other variables that may affect the reaction, such as pH, 

temperature, and the presence of other divalent metals, are in the process of being 
evaluated further. 
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Example 2 

Materials and Mgihgds 

A - Oligonucleotides and OlioonifclPQtide Analogs 

Synthetic DNAs and DNA analogs were purchased from Operon Technologies. 
The 19-nucleotide substrate, 5'-pTCACTATrAGGAAGAGATGG-3' (or 5'- 
pTCACTATNGGAAGAGATGG-3', wherein "N" represents adenosine ribonucleotide) 
(SEQ ID NO 7), was prepared by reverse-transcriptase catalyzed extension of 
5'-pTCACTATrA-3' (or 5*-pTCACTATN-3\ wherein "NT represents adenosine 
ribonucleotide) (SEQ ID NO 8), as previously described (Breaker, Banerji; & Joyce, 
Biochemistry 33: 11980-11986 (1994)), using the template 
5'-CCATCTCTTCCTATAGTGAGTCCGGCTGCA-3' (SEQ ID NO 9), Primer 3, 5'- 
GGGACGAATTCTAATACGACTCACTATrA-3* (or 5'- 

GGGACGAATTCTAATACGACTCACTATN-3', wherein "N" represents adenosine 
ribonucleotide) (SEQ ID NO 6), was either 5'-labeled with {v- 32 P]ATP and T4 
polynucleotide kinase (primer 3a) or 5*-thiophosphorylated with (y-SJATP and T4 
polynucleotide kinase and subsequently biotinylated with /V-iod o acetyl- AT - 
biotinylhexylenediamine (primer 3b). 
B. PNA PqqI Preparation. 

The starting pool of DNA was prepared by PGR using the synthetic oligomer 
S'-GTGCCAAGCTTACCG-Nso-GTCGCCATCTCTTCC-S' {SEQ ID NO 4), where N is an 
equimolar mixture of G, A. T and C. A 2-ml PCR, containing 500 pmoles of the 
randomized oligomer, 1,000 pmoles primer 1 (S'-GTGCCAAGCTTACCG-S', SEQ ID NO 
10), 500 pmoles primer 2 

(5*-CTGCAGAATTCTAATACGACTCACTATAGGAAGAGATGGCGAC-3', SEQ ID NO 1 1), 
500 pmoles primer 3b, 10 ^Ci (a- 32 P)dATP, and 0.2 U ix\ y Taq DNA polymerase, was 
incubated in the presence of 50 mM KCI, 1 .5 mM MgCI 2 , 10 mM Tris-HCI (pH 8.3 at 
23'C), 0.01% gelatin, and 0.2 mM of each dNTP for 1 min at 92 # C, 1 min at 50'C, and 
2 min at 72°C, then 5 cycles of 1 min at 92*C, 1 min at 50 # C, and 1 min at 72"C. The 
resulting mixture was extracted twice with phenol and once with chloroform Aisoamyl 
alcohol, and the DNA was isolated by precipitation with ethanol. 

c. in Vitry Selection 

The starting pool of DNA was resuspended in 500 /iL of buffer A (1 M NaCI and 
50 mM HEPES (pH 7.0 at 23*Q) and was passed repeatedly over a streptavidin column 
(AffiniTip Strep 20, Genosys, The Woodlands, TX). The column was washed with five 
100-jil volumes of buffer A, followed by five 100-^1 volumes of 0.2 N NaOH, then 
equilibrated with five 100-^1 volumes of buffer B (0.5 M NaCI, 0.5 M KCI, 50 mM 
MgCI 2 , and 50 mM HEPES (pH 7.0 at 23*C)). The immobilized single-stranded DNA was 
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eluted over the course of 1 hr with three 20-^1 volumes of buffer B with added 1 mM 
PbOAc. The entire immobilization and elution process was conducted at 23'C. The 
eluant was collected in an equal volume of buffer C (50 mM HEPES {pH 7.0 at 23 °C) 
and 80 mM EDTA) and the DNA was precipitated with ethanoi. 
5 The resulting DNA was amplified in a 100-^uL PCR containing 20 pmoles primer 

1, 20 pmoles primer 2, 0.05 U /J' 1 Taq polymerase, 50 mM KCl, 1.5 mM MgCI 2 , 10 mM 
Tris-HCI (pH 8.3 at 23"C), 0.01 % gelatin, and 0.2 mM of each dNTP for 30 cycles of 
10 sec at 92*C, 30 sec at 50 a C, and 30 sec at 72*C. The reaction products were 
extracted twice with phenol and once with chloroform / isoamyl alcohol, and the DNA 

1 0 was recovered by precipitation with ethanoi. Approximately 4 pmoles of the amplified 

;• DNA was added to a second, nested PCR containing 100 pmoles primer 1, 100 pmoles 
primer 3b, 20 ^Ci [a- 32 P]dATP, and 0.1 U ^ Taq polymerase, in a total volume of 200 
jzL that was amplified for 10 cycles of 1 min at 92*C, 1 min at 50*C, and 1 min at 
72'C. The PCR products were once more extracted and precipitated, and the resulting 

1 5 DNA was resuspended in 50 fuL buffer A, then used to begin the next round of 

selection. 

The second and third rounds were carried out as above, except that the nested 
PCR at the end of the third round was performed in a 100-^1 volume. During the fourth 
round, the elution time following addition of Pb 2 " was reduced to 20 min {two 20-^L 

20 elution volumes) and only half of the recovered DNA was used in the first PCR, which 

involved only 1 5 temjperature cycles. During the fifth round, the elution time was 
reduced to 1 min (two 20-^L elution volumes) and only one-fourth of the recovered DNA 
was used in the first PCR, which involved 15 temperature cycles. DNA obtained after 
the fifth round of selection was subcloned and sequenced, as described previously 

25 (Tsang &. Joyce, Biochemistry 33 : 5966-5973 (1994)). 

D. fginetir Analyst nf Catalytic DNAs 

Populations of DNA and various subcloned individuals were prepared with a 
5'-"P label by asymmetric PCR in a 25-^1 reaction mixture containing 10 pmoles primer 
3a, 0.5-pmoles-mput DNA-, andO.1 U u\' x 7"<?<r polymerase, under conditions as described 

30 above, for 10 cycles of 1 min at 92°C, 1 min at 50°C, and 1 min at 72°C. The 

resulting [5'-"PHabeled amplification products were purified by electrophoresis in a 
10% polyacrylamide / 8 M gel. 

Self-cleavage assays were carried out following preincubation of the DNA in 
buffer B for 10 min. Reactions were initiated by addition of PbOAc to 1 mM final 

35 concentration and were terminated by addition of an equal volume of buffer C. Reaction 

products were separated by electrophoresis in e 10% polyacrylamide / 8 M gel. Kinetic 
assays under multiple-turnover conditions were carried out in buffer B that included 50 
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^tg ml' 1 BSA to prevent adherence of material to the vessel walls. Substrate and enzyme 
molecules were preincubated separately for 5 min in reaction buffer that lacked Pb 2 \ 
then combined, and the reaction was initiated by addition of PbOAc to a final 
concentration of 1 mM. 
5 Example 3 

Evolution of Deoxvribozvmes 

That C'egvs lntermQlgcyjariy 
a. Conversion to an Intermolscuigr Format 

Based on the variable pattern of presumed base-pairing interactions between the 

1 0 catalytic and substrate domains of the studied variants, it was felt that it would be 

reasonably straightforward to convert the DNA-catalyzed reaction to an intermoiecular 
format. In doing so, we wished to simplify the two substrate-binding regions of the 
catalyst so that each would form an uninterrupted stretch of 7-8 base pairs with the 
substrate. In addition, we wished to provide a minimal substrate, limited to the two 

1 5 baserpairing regions and: the intervening sequence 5*-GGA-3' (Fig. 4A). 

Figures 4A and 4B illustrate DNA-catalyzed cleavage of an RNA phosphoester in 
an intermoiecular reaction that proceeds with catalytic turnover. Figure 4A is a 
diagrammatic representation of the complex formed between the 19mer substrate and 
38mer ONA enzyme. The substrate contains a single adenosine ribonucleotide ("rA" or 

20 "N\ adjacent to the arrow), flanked by deoxyribonucleotides. The synthetic DNA 

enzyme is a 38-nucleotide portion of the most frequently occurring variant shown in Fig. 
3. Highly-conserved nucleotides located within the putative catalytic domain are 
"boxed". As illustrated, one conserved sequence is "AGCG", while another is "CG" 
(reading in the 5'-3' direction). 

25 Figure 4B shows an Eadie-Hofstee plot used to determine K m (negative slope) 

and (y-intercept) for DNA-catalyzed cleavage, of [5'- a2 P]-Iabeled substrate under 
conditions identical to those employed during in vitro selection. Initial rates of cleavage 
were determined for reactions involving 5 nM DNA enzyme and either 0.125, 0.5, 1, 2, 
or 4 /iM substrate. 

30 In designing the catalytic domain, we relied heavily on the composition of the 

most reactive variant, truncating by two nucleotides at the 5' end and 1 1 nucleotides at 
the 3' end. The 15 nucleotides that lay between the two template regions were left 
unchanged and a single nucleotide was inserted into the 3' template region to form a 
continuous stretch of nucleotides capable of forming base pairs with the substrate. The 

35 substrate was simplified to the sequence 5'- TCACTATrA • GGA AGAGATG G-3' (or 

5'- TCACTATN • GGA AGAGATG G-3', wherein "N" represents adenosine ribonucleotide) 
(SEQ ID NO 12), where the underlined nucleotides correspond to the two regions 
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involved in base pairing with the catalytic DNA molecule. 

The simplified reaction system, employing a 38mer catalytic DNA molecule 
(catalyst) comprised entirely of deoxyribonucleotides and a 1 9mer substrate containing a 
single ribonucleotide embedded within an otherwise all-ONA sequence, allows efficient 
5 DNA-catalyzed phosphoester cleavage with rapid turnover. Over a 90-minute incubation 

in the presence of 0.01 nM catalyst and 1 nM substrate, 46% ol the substrata is 
cleaved, corresponding to 46 turnovers of the catalyst. A preliminary Kinetic analysis of 
this reaction was carried out, evaluated under multiple-turnover conditions. The DNA 
catalyst exhibits Michaelis-Menten kinetics, with values for k„, and K m of 1 min' and 2 
10 fiM. respectively (see Fig. 4B). The value for K m is considerably greater than the 

expected dissociation constant between catalyst and substrate based on Watson-Crick 
interactions. The substrate was incubated under identical reaction conditions (but in the 
absence of the catalyst); a value for U of 4 ■ 10* min ' was obtained. This is 
consistent with the reported value Of 5 x 10 J min- for hydrolysis of the more labile 
15 1-nitrophenyl-1,2-propanediol in the presence of 0.5 mM Pb J * at pH 7.0 and 37'C 

(Breslow & Huang, P±JA£iiSA_aa: 4080-4083 (1991)1. 

It is now presumed that the phosphoester cleavage reaction proceeds via a 
hydrolytic mechanism involving attack by the ribonucleoside 2 '-hydroxyl on the vicinal 
phosphate, generating a 5' product with a terminal 2'(3')-cyclic phosphate and 3' 
20 product with a terminal 5'-hydroxyl. In support of this mechanism, the 3'-cleavage 

product is efficiently phosphorylated with T4 polynucleotide kinase and (y-"P]ATP, 
consistent with the availability of a free 5'-hydroxyl (data not shown). 
B. Discussion 

After five rounds of in vitro selection, a population of single-stranded DNA 
25 molecules that catalyze efficient Ptf--d-p.nd.nt cl.avag. of a target RNA phosphoester 

was obtained. Based on the common features of representative individuals isolated 
from this population, a simplified version of both the catalytic and substrate domains 
was constructed, leading to a demonstration of rapid catalytic turnover in an ■ 
intermodular context. Thus the 38mer catalytic domain provides-an^exem^of a.DNA 
30 enzyme, or what might be termed a 'deoxyribozyme*. 

Referring to this molecule as an enzyme, based on the (act that it is an 
informational macromolecule capable of accelerating a chemical transformation in a ■ 
reaction that proceeds with rapid turnover and obeys Michaelis-Menten kinetics, may 
not satisfy everyone's notion of what constitutes an enzyme. Some might insist that an 
35 enzyme, by definition, must be a polypeptide. If. however, one accepts the notion of an 

RNA enzyme, then it seems reasonable to adopt a similar view concerning DNA 
enzymes. Considering how quickly we were able to generate this molecule from a pool 
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of random-sequence DNAs, we expect that many other examples of synthetic DNA 
enzymes will appear in the near future. 

The Pb 2 + -dependent cleavage of an RNA phosphoester was chosen as an initial 
target for DNA catalysis because it is a straightforward reaction that simply requires the 
5 proper positioning of a coordinated Pb 2 *-hydroxyl to facilitate deprotonation of the 2 ' 

hydroxy! that lies adjacent to the cleavage site. {See, e.g., Pan, et al., in The RNA 
World . Gesteland & Atkins (eds.), pp. 271-302, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY (1993).) Pb 2 * is known to coordinate to the N7 position of 
purines, the 06 position of guanine, the 04 position of uracil, and the N3 position of 

10 cytosine (Brown, et al., Nature 303 : 543-546 (1993)). Thus, the differences in sugar 

. composition and conformation of DNA compared to RNA seemed unlikely to prevent 
DNA from forming a well-defined Pb 2 * -binding pocket. 

A substrate that contains a single ribonucleotide within an otherwise all-DNA 
sequence was chosen because it provided a uniquely favored site for cleavage and 

1 5 insured that any resulting catalytic activity would be attributable solely to DNA. 

Substrate recognition appears to depend on two regions of base-pairing interactions 
between the catalyst and substrate. However, the unpaired substrate nucleotides, 
5'-GGA-3\ that lie between these two regions may play an important role in substrate 
recognition, metal coordination, or other aspects of catalytic function. 

20 It is further anticipated that an all-RNA molecule, other RNA-DNA composites, 

and molecules containing one or more nucleotide analogs may be acceptable substrates. 
As disclosed herein, the within-described in vitro evolution procedures may successfully 
be used to generate enzymatic DNA molecules having the desired specificities; further 
analyses along these lines are presently underway. 

25 In addition, studies to determine whether the presumed base-pairing interactions 

between enzyme and substrate are generalizabte with respect to sequence are in 
progress, using the presently-described methods. The within-disclosed Pb 2 * -dependent 
deoxyribozymes may also be considered model compounds for exploring the structural 
and enzymatic properties o1 DNA. 

30 The methods employed in the present disclosure for the rapid development of 

DNA catalysts will have considerable generality, allowing us to utilize other cofactors to 
trigger the cleavage of a target linkage attached to a potential catalytic domain. In this 
regard, the development of Mg 2+ -dependent DNA enzymes that specifically cleave 
target RNAs under physiological conditions is of interest, as is the development of DNA 

35 enzymes that function in the presence of other cations (see Example 4). Such 

molecules will provide an alternative to traditional antisense and ribozyme approaches 
for the specific inactivation of target mRNAs. 
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DNA thus joins RNA and protein on the list of biological macromolecules that are 
capable of exhibiting enzymatic activity. The full extent of DNA's catalytic abilities 
remains to be explored, but these explorations should proceed rapidly based on in vitro 
selection methods such as those employed in this study. 
5 DNA enzymes offer several important advantages compared to other 

macromolecular catalysts. First, they are easy to prepare, in an era when most 
laboratories have access to an automated ONA synthesizer and the cost of DNA 
phosphoramidites has become quite modest. Second, they are very stable compounds, 
especially compared to RNA, thus facilitating their use in biophysical studies. Third, we 
1 0 expect that they can be adapted to therapeutic applications that at present make use of 

.. antisense DNAs that lack RNA-cleavage activity. In vitro selection could be carried out 
with DNA analogs, including compounds that are nuclease resistant such as 
phosphorothioate-containing DNA, so long as these analogs can be prepared in the form 
of a deoxynucleoside 5Mriphosphate and are accepted as a substrate by a 
1 5 DNA-dependent DNA polymerase. Finally, DNA enzymes offer a new window on our 

understanding of the macromolecular basis of catalytic function. It will be interesting, 
for example, to carry out comparative analyses of protein-, RNA-, and DNA-based 
enzymes that catalyze the same chemical transformation. 

Example 4 

20 pthftr Familie i p« r-xtalvtic DNAs 

A starting pool of DNA was prepared by PCR essentially as described in Example 
2.B. above, except that the starting pool of DNA comprised molecules containing 40 
random nucleotides. Thus, the starting pool of DNA described herein was prepared by 
PCR using the synthetic oligomer 5 ' GGG ACQ AAT TCT AAT ACQ ACT CAC TAT rA 

25 GG AAG AGA TGG CGA CAT CTC N W GT GAC GGT AAG CTT GGC AC 3 • ISEQ ID NO 

23). where N is en equimolar mixture of G, A, T and C. and where the DNA molecules 
were selected for the ability to cleave the phosphoester following the target rA. (See 

Figure 6A. also.) ^ 
Selective amplification was carried out in the presence of either Pb^.Zn^.Mn 
30 or Mg J \ thereby generating at least four -families" of catalytic DNA molecules. As 

illustrated in Figure 5, catalytic DNA molecules demonstrating specific activity were 
generated in the presence of a variety of cations. 

Figure 5 is a photographic representation showing a polyacrylamide gel 
demonstrating specific endoribonuclease activity of four families of seiected catalytic 
35 DNAs. Selection of a Pb" -dependent family of molecules was repeated in a side-by- 

side fashion as a control. In each group of three lanes, the first lane shows the lack of 
activity of the selected populetion in the absence of the metal cation, the second lane 
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shows the observed activity in the presence of the metal cation, and the third lane - 
shows the lack of activity of the starting pool (GO). At present, the order of reactivity is 
observed to be Pb 2 * > Zn 2 * > Mn 2 * > Mg 2 + , mirroring the pK, of the corresponding metal- 
hydroxide. 

After either five (G5) or six {G6} rounds of selective amplification in the presence 
of the preselected divalent cation, the desired endonuclease activity was obtained. The 
following description of selective amplification in the presence of Mg 2 + is intended to be 
exemplary. 

Six rounds of in vitro selective amplification were carried out, following the 
method described in Example 2 hereinabove, except that the divalent metal used was 1 
mM Mg 2 * rather than 1 mM Pb 2 \ (See also Breaker and Joyce, Chem. & Rinl. y 
223-229 (1994), incorporated by reference herein, which describes essentially the same 
procedure.) 

Individual clones were isolated following the sixth round, and the nucleotide 
sequence of 24 of these clones was determined. All of the sequences began with: 5 ' 
GGG ACG AAT TCT AAT ACG ACT CAC TAT rA GG AAG AGA TGG CGA CA (SEQ ID 
NO 23 from position 1 to 44} and ended with: CGG TAA GCT TGG CAC 3' (SEQ ID 
NO 23 from position 93 to 107). 

The segment in the middle, corresponding to TCTC N 40 GTGA (SEQ ID NO 23 
from position 45 to 92) in the starting pool, varied as follows: 

|1 3) CCG CCC ACC TCT TTT ACG AGC CTG TAC GAA ATA GTG CTC TTG 

TTA GTA T (SEQ IO NO 24) 
<5) TCT CTT CAG CGA TGC ACG C TT GTT TTA ATG TTG CAC CCA TfiT 

IAG TGA (SEQ ID NO 25) 
(2) TCT CAT CAG CGA TTG AAC CAC TTG GTG GAC AGA CCC ATG TTA 

GTG A (SEQ ID NO 26) 
(1 ) CCG CCC ACC TCT TTT ACG AGC CTG TAC GAA ATA GTG TTC TTG 

TTA GTA T (SEQ ID NO 27) 
(1 ) CCG CCC ACC TCT TTT ACG AGC CTG TAC GAA ATA GTG CTC TCG 

TTA GTA T (SEQ ID NO 28) 
(1 ) TCT CAG ACT TAG TCC ATC ACA CTC TGT GCA TAT GCC TGC TTG 

ATG TGA (SEQ ID NO 29) 
(1 i -CT CTC ATC TGC TAG CAC GCT CGA ATA GTG TCA GTC GAT GTG A 

(SEQ ID NO 30). 
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The initial number in parentheses indicates the number of clones having that 
particular sequence. Note that some mutations (highlighted in bold type} occurred at 
nucleotide positions other than those that were randomized initially. 

The second sequence listed above (i.e., SEQ ID NO 25), which occurred in 5 of 
5 24 clones, was chosen as a lead (i.e. principal) compound for further study. Its 

cleavage activity was measured in the presence of a 1 mM concentration of various 
divalent metals and 1 M NaCI at pH 7.0 and 23*C: 

metal K», [ntin*] 

10 none n.d. 

Mg 2+ 2-3 x TO' 3 

Mn 2 * 6.8 x 10- 3 

Zn 2+ 4.2 x10' 2 

Pb J * 1.1 x 10 J 

15 

Thus, the lead compound is active in the presence of all four divalent metals, 
even though it was selected for activity in the presence of Mg 2 *. Conversely, DNA 
molecules that were selected for activity in the presence of Mn 2 *, Zn 2 \ or Pb 2 * did not 
show any activity in the presence of Mg 2 *. 

20 In addition, the population of DNAs obtained after six rounds of in vitro selection 

in the presence of Mg 2 *, when prepared as all-phosphorothioate-containing DNA 
analogs, showed Mg 2 '-dependent cleavage activity at an observed rate of - TO' 3 min\ 
The phosphorothioate-containing analogs were prepared enzymatically so as to have an 
/?p configuration at each stereocenter. Such compounds are relatively resistant to 

25 degradation by cellular nucleases compared to unmodified DNA. 

The lead compound was re-randomized at 40 nucleotide positions (underlined), 
introducing mutations at a frequency of 15% (5% probability of each of the three 
possible base substitutions). The re-randomized population was subjected to seven 
additional rounds of in vitro selection. During the last four rounds, molecules that were 

30 reactive in the presence of 1 mM Pb 2+ were removed from the population before the 

remainder were challenged to react in the presence of 1 mM Mg 2 *. Individual clones 
were isolated following the seventh round and the nucleotide sequence of 14 of these 
clones was determined. All of the sequences began with: 5 ' GGG ACG AAT TCT AAT 
ACG ACT CAC TAT rA GG AAG AGA TGG CGA CAT CTC (SEQ ID NO 23. from position 

35 1 to 48), and ended with: GTG ACG GTA AGC TTG GCA C 3 ' (SEQ ID NO 23, from 

position 89 to 107). 

The segment in the middle, corresponding to the 40 partially-randomized 
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positions (N^, SEQ ID NO 23, from position 49 to 88), varied as follows: 



10 



15 



2C 



(4) TAC AGC GAT TCA CCC TTG TTT AAG GGT TAG ACC CAT GTT A 
(SEQ ID NO 31) 

(2) ATC AGC GAT TAA CGC TTG TTT CAA TGT TAC ACC CAT GTT A 
(SEQ ID NO 32) 

(2) TTC AGC GAT TAA CGC TTA TTT TAG CGT TAC ACC CAT GTT A 
{SEQ ID NO 33) 

( 1 ) ATC AGC GAT TCA CCC TTG TTT TAA GGT TGC ACC CAT GTT A 
(SEQ ID NO 34) 

( 1 ) ATC AGC GAT TCA CCC TTG TTT AAG CGT TAC ACC CAT GTT G 
(SEQ ID NO 35) 

( 1 ) ATC AGC GAT TCA CCC TTG TTT TAA GGT TAC ACC CAT GTT A 
(SEQ ID NO 36) 

( 1 ) ATC AGC GAT TAA CGC TTA TTT TAG CGT TAC ACC CAT GTT A 
(S€Q ID~N© 37) 

(1 ) ATC AGC GAT TAA CGC TTG TTT TAG TGT TGC ACC CAT GTT A 
(SEQ ID NO 38) 

(1 ) ATC AGC GAT TAA CGC TTA TTT TAG CAT TAC ACC CAT GTT A 
(SEQ ID NO 39). 



The number in parentheses indicates the number of clones having that particular 
sequence. Nucleotides shown in bold are those that differ compared to the lead 
compound. 

25 Formal analysis of the cleavage activity of these clones is ongoing. The 

population as a whole exhibits Mg 2+ -dependent cleavage activity at an observed rate of 
— 10" a min*\ with a comparable level of activity in the presence of Pb 2 + . 

Figures 6A and 6B provide two-dimensional illustrations of a "progenitor" 
catalytic-DNA molecule and one of several catalytic DNA molecules obtained via the 

30 selective amplification methods disclosed herein, respectively. Figure 6A illustrates an 

exemplary molecule from the starting pool, showing the overall configuration of the 
molecules represented by SEQ ID NO 23. As illustrated, various complementary 
nucleotides flank the random (N^) region* 

Figure 6B is a diagrammatic representation of one of the Mg a+ -dependent 

35 catalytic DNA molecules (or "DNAzymes") generated via the within-described 

procedures. The location of the ribonucleotide in the substrate nucleic acid is indicated 
via the arrow. (The illustrated molecule includes the sequence identified herein as SEQ 
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ID NO 25, as wall as "beginning" and "ending" sequences ol SEQ ID NO 23.) 

Endonuclease activity is continuing to be enhanced in each of the 
aforementioned "families" via in vitro evolution, as disclosed herein, so it is anticipated 
that enzymatic DNA molecules ol increasingly desirable specificities may be generated 
5 successfully using the within-disclosed guidelines. 

Example 5 
^Ravage of | ?rnsr RNA Sf-nuences 
As an extension of the foregoing, we have developed DNA enzymes that cleave 
an all-RNA substrate, rather than a single ribonucleotide embedded within an otherwise 
1 0 all-DNA substrate as demonstrated above. (Also see R.R. Breaker & G.F. Joyce, Omul 

X, Bioi, 1 ; 223-229 (1994): R.R. Breaker & G.F. Joyce, Chim & Biol. 2: 655-660 
(1995)). As a target sequence, we chose a stretch of 12 highly-conserved nucleotides 
within the U5 LTR region of HIV-1 RNA, having the sequence 
5' GUAACUAGAGAU 3' (SEQ ID NO 49). 
! 5 Following the methods described in the previous examples, we generated a pool 

of 1014 DNA molecules that have the following composition: 

5'- GGAAAA r(GUAACUAGAGAU) GGAAGAGATGGCGAC N so 
CGGTAAGCTTGGCAC -3' (SEQ ID NO 50I, 
where N is an equimolar mixture of the deoxyribonucleotides G, A, T, and C. and where 
20 the sequence identified as 'r(GUAACUAGAGAU) * is comprised of ribonucleotides. 

(Optionally, one may alter the initial 5' nucleotide sequence, e.g., by adding an 
additional dA residue to the sequence preceding the ribonucleotide portion at the 5' end, 
thus causing the initial sequence to read 'GGAAAAA" and causing SEQ ID NO 50 to be 
99 residues in length. Clearly, this is but one example of the modifications that may be 
25 made in order to engineer specific enzymatic DNA molecules, as disclosed in detail 

herein.) 

The enzymatic DNA molecules thus produced were selected for their ability to 
cleave a phosphoestar that lies within the embedded RNA target sequence. Tan rounds 
of in vitro selective amplification were carried out. basetirwEtheaea^^sJai^ 
30 molecules' activity in the presence of 1 0 mM Mg»* at pH 7.5 and 37'C. During the 

' selection process, there was competition for "preferred" cleavage sites as well as for the 
"best" catalyst that cleaves at each such preferred site. Two sites and two families of 
catalysts emerged as possessing the most efficient cleavage capabilities (see Fig. 7). 
Figure 7 illustrates some of the results of ten rounds of in vitro selective 
35 amplification carried out essentially as described herein. As shown, two sites and two 

families of catalysts emerged as displaying the most efficient cleavage of the target 
sequence. Cleavage conditions were essentially as indicated in Fig. 7, namely. 10mM 
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Mg 2 \ pH 7.5, and 37 # ; data collected after the reaction ran for 2 hours is shown. 
Cleavage (%) is shown plotted against the number of generations (here, 0 through 10). • 
The number/prevalence of catalytic DNA molecules capable of cleaving the target 
sequence at the indicated sites in the substrate is illustrated via the vertical bars, with 
5 cleavage at G IUAACUAGAGAU shown by the striped bars, and with cleavage at 

GUAACUAIGAGAU illustrated via the open (lightly-shaded) bars. In Figure 7, as herein, 
the arrow (1) indicates the site between two neighboring nucleotides at which cleavage 
occurs. 

Various individuals from the population obtained after the 8th and 10th rounds 

10 of selective amplification were cloned. The nucleotide sequences of 29 individuals from 

the 8th round and 32 individuals from the 10th round were then determined (see Tables 
2 and 3, respectively). 

Under the heading "Nucleotide Sequence" in each of Tables 2 and 3 is shown 
the portion of each identified clone that corresponds to the 50 nucleotides that were 

1 5 randomized in the starting pool (i.e., N 50 ); thus, the entire- nucleotide sequence of a 

given clone generally includes the nucleotide sequences preceding, following-, and 
including the "N so " segment, presuming the substrate sequence is attached and that 
self-cleavage has not occurred. For example, the entire sequence of a (non-self-cleaved) 
clone may generally comprise residue nos. 1-33 of SEQ ID NO 50, followed by the 

20 residues representing the randomized N 50 region, followed by residue nos. 84-98 of SEO 

ID NO 50, or by residue nos. 1-34 of SEQ ID NO 51, followed by the residues 
representing the randomized N 50 region, followed by residue nos. 85-99 of SEQ ID NO 
51. It is believed, however, that the N 50 (or N^) region - or a portion thereof - of each 
clone is particularly important in determining the specificity and/or activity of a particular 

25 enzymatic DNA molecule. This is particularly evident in reactions in which the substrate 

and the DNAzyme are separate molecules (see, e.g.. Figs. 8 and 9). 

Clone numbers are designated as 8-x or 10-x for individuals obtained after the 
8th or 10th rounds, respectively. SEQ ID NOS are also listed and correspond to the 
"N 50 " region of each clone. 
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15 



30 



Table 2 

Cloned Individuals from 8th Round of Amplification 



Clone SEQ /e , 

no, id NO ' N " I Nucleotide Sequence L5 -3J - 

8 .2 52 CCA ATA GTG CTA CTG TGT ATC TCA ATG CTG GAA ACA CGG GTT 
ATC TCC CG 

g.4 53 CCA AAA CAG TGG AGC ATT ATA TCT ACT CCA CAA AGA CCA CTT 
TTC TCC CG 

8-5' 54 ATC CGT. ACT AGC ATG CAG ACA GTC TGT CTG CTT TTT CAT TAC 
TCA CTC CC 

8 .1 4 55 CAA TTC ATG ATG ACC AAC TCT GTC AAC ACG CGA ACT TTT AAC 
ACT GGC A 

8-1 V 56 CTT CCA CCT TCC GAG CCG GAC GAA GTT ACT TTT TAT CAC ACT 
ACG TAT TG 

GGC AAG AGA TGG CAT ATA TTC AGG TAA CTG TGG AGA TAC CCT 
GTC TGC CA 

CTA GAC CAT TCA CGT TTA CCA AGC TAT GGT AAG AAC TAG AAT 



8-3 57 
8-6 58 
20 8-8 59 



CAC GCG TA 

CGT ACA CGT GGA AAA GCT ATA AGT CAA GTT CTC ATC ATG TAC 
CTG ACC GC 

8-10 60 CAG TGA TAC ATG AGT GCA CCG CTA CGA CTA AGT CTG TAA CTT 
ATT CTA CC 

8-22 61 ACC GAA TTA AAC TAC CGA ATA GTG TGG TTT CTA TGC TTC TTC 
25 TTC CCT GA 

8-1 1 62 CAG GTA GAT ATA ATG CGT CAC CGT GCT TAC ACT CGT TTT ATT 

AGT ATG TC 

8-21 63 CCC TAC AAC ACC ACT GGG CCC AAT TAG ATT AAC GCT ATT TTA 
TAA CTC G 

8-12 64 CCA AAC GGT TAT AAG ACT GAA AAC TCA ATC AAT AGC CCA ATC 
CTC GCC C 

8.1 3 65 CAC ATG TAT ACC TAA GAA ATT GGT CCC GTA GAC GTC ACA GAC 
TTA CGC CA 

8-23 66 CAC AAC GAA AAC AAT CTT CCT TGG CAT ACT GGG GAG AAA GTC 
35 TGT TGT CC 

8-40 67 CAC ACG AAC ATG TCC ATT AAA TGG CAT TCC GTT TTT CGT TCT 
ACA TAT GC 
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8-24 68 CAG AAC GAG GGT CTT GTA AGA CTA CAC CTC CTC AGT GAC AAT 
AAT CCT G 

8-26 69 CAC TAC AGC CTG ATA TAT ATG A AG AAC AGG CAA CAA GCT TAT 
GCA CTG G 

5 8-27 70 GGG TAC ATT TAT GAT TCT CTT ATA AAG AGA ATA TCG TAC TCT 

TTT CCC CA 

8-28 7 1 CCA AAG TAC ATT CCA ACC CCT TAT ACG TGA AAC TTC CAG TAG 
TTT CCT A 

8-29 72 CTT GAA GAT CCT CAT AAG ACG ATT AAA CAA TCC ACT GGA TAT 
10 AATCCGGA 

8-34 73 CGA ATA GTG TCC ATG ATT ACA CCA ATA ACT GCC TGC CTA TGA 
TGT TTA TG 

8-35 74 CCA AGA GAG TAT CGG ATA CAC TTG GAA CAT AGC TAA CTC GAA 
CTG TAC CA 

1 5 8-36 75 CCA CTG ATA AAT AGG TAA CTG TCT CAT ATC TGC CAA TCA TAT 

GCC GTA 

8-37 76 CCC AAA TTA TAA ACA ATT TAA CAC AAG CAA AAG GAG GTT CAT 
TGC TCC GC 

8-39 77 CAA TAA ACT GGT GCT AAA CCT AAT ACC TTG TAT CCA AGT TAT 
20 CCT CCC CC 

1 identical to 10-4, 10-40 

2 identical to 8-20, 8-32, 8-38, 10-1, 10-34; 1 mutation to 10-1 1; 3 mutations 
25 to 10-29 

Table 3 

Cloned Individuals from 10th Round of Amplification 

30 Clone SEQ 

No. id no IIW Nucleotide Sequence (5'-3') 

1 0-3 3 78 CCG AAT GAC ATC CGT AGT GGA ACC TTG CTT TTG ACA CTA AGA 
AGC TAC AC 

10-10 79 CCA TAA CAA ATA CCA TAG TAA AGA TCT GCA TTA TAT TAT ATC 
35 GGT CCA CC 

10-12 80 CAG AAC AAA GAT CAG TAG CTA AAC ATA TGG TAC AAA CAT ACC 
ATC TCG CA 

10-14 81 CCT TTA GTT AGG CTA GCT ACA ACG ATT TTT CCC TGC TTG GCA 
ACG ACA C 
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10-1 5 82 CTCCCT ACG TTA CAC CAG CGG TAG GAA TTT TCC ACG AGA GGT 
AAT CCG CA 

10-19 83 CGG CAC CTC TAG TTA GAC ACT CCG GAA TTT TTC CCC 
, 0-39 84 CGG CAC CTC TAG TTA GAC ACT CCG GAA TTT TAG CCT ACC ATA 
GTC CGG T 

, 0-23 85 CCC TTT GGT TAG GCT AGC TAC AAC GAT TTT TCC CTG CTT GAA 
TTG TA 

, 0 -27< 86 CCC TTT GGT TAG GCT AGC TAC AAC GAT TTT TCC CTG CTT GAC 
CTG TTA CGA 

,0-31 87 CCT TTA GTT AGG CTA GCT ACA ACG ATT TTT CCC TGC TTG GAA 
CGA CAC 

,0-, 8 88 CAT GGC TTA ATC ATC CTC AAT AGA AGA CTA CAA GTC GAA TAT 
GTC CCC CC 

10 .20 89 CAA CAG AGC GAG TAT CAC CCC CTG TCA ATA GTC GTA TGA AAC 

1 5 ATT GGG CC 

10 -6 90 TAC CGA CAA GGG GAA TTA AAA GCT AGC TGG TTA TGC AAC CCT 

TTT CGC A 

n0 -7 91 CTC GAA ACA GTG ATA TTC TGA ACA AAC GGG TAC TAC GTG TTC 
AGC CCC C 

,0-8 92 CCA ATA ACG TAA CCC GGT TAG ATA AGC ACT TAG CTA AGA TGT 

TTA TCC TG 

,0-ia 93 CAA TAC AAT CGG TAC GAA TCC AGA AAC ATA ACG TTG TTT CAG 

AAT GGT CC ^ K „n 

,0-2, 94 GCA ACA ACA AGA ACC AAG TTA CAT ACA CGT TCA TCT ATA CTG 



10 



20 



oc AAC CCC CA 

,0-24 95 CCT TTG AGT TCC TAA ATG CCG CAC GGT AAG CTT GGC ACA CTT 

TGA CTG TA 

,0-28 96 CAA AGA TCT CAC TTT GGA AAT GCG AAA TAT GTA TAT TCG CCC 

TGTCTGX f*>~r 
30 1043 97 CCA CGT AGA ATT ATC TGA TTT ATA ACA TAA CGC AGG ATA ACT 

CTCGCCCA 

,0.35 98 CAC AAG AAA GTG TCG TCT CCA GAT ATT TGA GTA CAA GGA ACT 

,0-36 99 CAT GAA GAA ATA GGA CAT TCT ACA GGC TGG ACC GTT ACT ATG 
CCT GTA GG 

,0-37 100 CAT AGG ATA ATC ATG GCG ATG CTT ATG ACG TGT ACA TCT ATA 
CCTT 
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10-38 101 CAG ATG ATC TTC CTT TAA AGA CTA CCC TTT AAA GAA ACA TAA 
GGT ACC CC 



3 1 mutation to 10-5 

4 1 mutation to 10-30 



The self-cleavage activity of various clones was subsequently measured. Clones 

10 8-5, 8-17, and 10-3 were found to cleave efficiently at the site 5' GUAACUl AGAGAU 

3*, while clones 10-14, 10-19 and 10-27 were found to cleave efficiently at the site 5' 
GIUAACUAGAGAU 3'. When the RNA portion of the molecule was extended to the 
sequence 5' GGAAAAAGUAACUAGAGAUGGAAG 3' (residue nos. 1-24 of SEQ ID NO 
51). clones 8-17, 10-14, and 10-27 retained full activity, while clones 8-5, 10-3, and 

15 10-19 showed diminished activity. Subsequently, clone 10-23 was found to exhibit a 

high level of activity in the -self-cleavage reaction involving the extended RNA domain. 

It should also be noted, in the event one of skill in the relevant art does not 
appreciate same, that the nucleotide sequences preceding and following the "N 50 " 
segments of the polynucleotide molecules engineered according to the teachings of the 

20 present invention disclosure may be altered in a variety of ways in order to generate 

enzymatic DNA molecules of particular specificities. For example, while residue nos. 1- 
24 of SEQ ID NO 51 are described herein as RNA nucleotides, they may alternatively 
comprise DNA, RNA, or composites thereof. (Thus, for example, SEQ ID NO 51 could 
easily be altered so that nucleic acid residue nos. 1-7 would comprise DNA, residue nos! 

25 8-19 would comprise RNA, residue nos. 20-99 would comprise DNA, and so on.) 

Similarly, the nucleotides following the "U M n region may comprise RNA, DNA, or 
composites thereof. The length of the regions preceding and following the "N,,," (or 
"N^" - see Example 4) region(s) may also be varied, as disclosed herein. Further, 
sequences preceding and/or following N eo or N^ regions may be shortened, expanded, 

30 or deleted in their entirety. 

Moreover, as noted above, we selected a specific region of HIV-1 RNA as the 
target sequence in the methods described in this Example; such a sequence is not the 
only sequence one may use as a target. Clearly, one of skill in the relevant art may 
follow our teachings herein to engineer and design enzymatic DNA molecules with 

35 specificity for other target sequences. As disclosed herein, such target sequences may 

be constructed or inserted into larger sequences comprising DNA, RNA, or composites 
thereof, as illustrated by SEQ ID NOS 50 and 51. 

The self-cleavage reaction was easily converted to an intermolecular cleavage 
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reaction by dividing the enzyme and substrate domains into separate molecules. Clones 
8-17 and 10-23 were chosen as prototype molecules. Both were shown to act as DNA 
enzymes in the cleavage of a separate all-RNA substrate in a reaction that proceeds with 
multiple turnover (Fig. 8). The substrate binding arms were subsequently reduced to 7 
5 base-pairs on each side of the unpaired nucleotide that demarcates the cleavage site 

(Fig. 9). 

Figure 8 illustrates the nucleotide sequences, cleavage sites, and turnover rates 
of two catalytic DNA molecules of the present invention, clones 8-17 and 10-23. 
Reaction conditions were as shown, namely, 10mM Mg 2 *. pH 7.5, and 37 W C The 

1 0 DNAzyme identified as clone 8-1 7 is illustrated on the left, with the site of cleavage of 

the RNA substrate indicated by the arrow. The substrate sequence (5' - 
GGAAAAAGUAACUAGAGAUGGAAG - 3'} - which is separate from the DNAzyme (i.e., 
intermolecular cleavage is shown) ~ is labeled as such. Similarly, the DNAzyme 
identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 

1 5 substrate indicated by the arrow. Again, the substrate sequence is indicated. For the 8- 

17 enzyme, the turnover rate was approximately 0.6 hr 1 ; for the 10-23 enzyme, the 
turnover rate was approximately 1 hr'\ 

As illustrated in Fig. 8, the nucleotide sequence of the clone 8-17 catalytic DNA 
molecule capable of cleaving a separate substrate molecule was as follows: 

20 B'-CTTCCACCTTCCGAGCCGGACGAAGTTACTTTTT-S' (residue nos. 1-34 of SEQ ID 

NO 56). In that same figure, the nucleotide sequence of the clone 10-23 catalytic DNA 
molecule capable of cleaving a separate substrate molecule was as follows: 
5' -CTTTGGTTAGGCTAGCTACAACGATTTTTCC-3' (residue nos. 3-33 of SEQ ID NO 

85). 

25 Figure 9 further illustrates the nucleotide sequences, cleavage sites, and 

turnover rates of two catalytic DNA molecules of the present invention, clones 8-17 and 
10-23. Reaction conditions were as shown, namely, 10mM Mg 2 \ pH 7.5, and 37 W C. 
As in Fig. 8, the DNAzyme identified as clone 8-17 is illustrated on the left, with the site 
of cleavage of the RNA substrate indicated by the=ancowv TOe^fasWl^^entse f5' - 

30 GGAAAAAGUAACUAGAGAUGGAAG - 3*) --which is separate from the DNAzyme (i.e., 

intermolecular cleavage is shown) - is labeled as such. Similarly, the DNAzyme 
identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 
substrate indicated by the arrow. Again, the substrate sequence is indicated. For the 8- 
17 enzyme, k ob , was approximately 0.002 min <; for the 10-23 enzyme, the value of k o6l 

35 was approximately 0.01 min' 1 . 

As illustrated in Fig. 9, the nucleotide sequence of the clone 8-17 catalytic DNA 
molecule capable of cleaving a separate substrate molecule was as follows: 
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5' -CCACCTTCCGAGCCGGACGAAGTTACT-3' (residue nos. 4-30 of SEQ ID NO 56). In 
that same figure, the nucleotide sequence of the clone 10-23 catalytic DNA molecule 
capable of cleaving a separate substrate molecule was as follows: 
5' -CTAGTTAGGCTAGCTACAACGATTTTTCC-3' (residue nos. 5-33 of SEQ ID NO 85, 
with "CTA" substituted for 'TTG" at the 5' end). 

The catalytic rate of the RNA-cleaving DNA enzymes has yet to be fully 
optimized. As disclosed above and as reported in previous studies, we have been able 
to improve the catalytic rate by partially randomizing the prototype molecule and 
carrying out additional rounds of selective amplification. We have found, however, that 
the K m for Mg 2 * is approximately 5 mM and 2 mM for the 8-17 and 10-23 DNA 
enzymes, respectively, measured at pH 7.5 and 37*C; this is certainly compatible with 
intracellular conditions. 

The foregoing specification, including the specific embodiments and examples, is 
intended to be illustrative of the present inventron and is nor to be taken as limiting. 
Numerous other variations and modifications can be effected without departing from the 
true spirit and scope of the present invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(I) APPLICANT: The Scripps Research Institute 

(ii) TITLE OF INVENTION: ENZYMATIC DNA MOLECULES 

(iii) NUMBER OF SEQUENCES: 101 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: The Scripps Research Institute 

(B) STREET: 1066 6 North Torrey Pines Road, TPC-8 

(C) CITY : La Jolla 

(D) STATE: California 

(E) COUNTRY: United States 
<F) ZIP: 92037 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1. 

(vi) CURRENT APPLICATION DATA: 

' (A) APPLICATION NUMBER: PCT/US95/ 

(B) FILING DATE: 01-DEC-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/472,194 

(B) FILING DATE: 07-JUN-199S 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/349,023 

(B) FILING DATE: 02-DEC-1994 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Logan, April C. 

(B) REGISTRATION NUMBER: 33,950 

(C) REFERENCE /DOCKET NUMBER: 4 63.2 PC 
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(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (619) 554-2937 

(B) TELEFAX: (619) 554-6312 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 15 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CGGTAAGCTT GGCAC 15 
20 (2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (ix) FEATURE : 

(A) NAME /KEY: misc_dif f erence 

(B) LOCATION: replace (8, " " } 

(D) OTHER INFORMATION : /s tandard_name» "ADENOSINE 
RIBONUCLEOTIDE" 

35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
TCACTATNAG GAAGAGATGG 20 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: £EQ ID NO : 3 : 

ACACATCTCT GAAGTAGCGC CGCCGTATAG TGACGCTA 38 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
{AJ LENGTH: 80 base pairs 
(B } TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GTGCCAAGCT TACCGNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 60 

80 

NNNNNGTCGC CATCTCTTCC 
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(2) INFORMATION FOR SEQ ID NO:5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
"(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



10 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 28 

15 (D) OTHER INFORMATION: /standard_name= M 2'3' CYCLIC 

PHOSPHATE" 

(ix) FEATURE: 

(A) NAME/ KEY : tnisc_dif f erence 
20 (B) LOCATION: replace (28, 

(D) OTHER INFORMATION: /standard_name= -ADENOSINE 
RIBONUCLEOTIDE" 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GGGACGAATT CTAATACGAC TCACTATN 28 



30 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: misc_dif f erence 

(B) LOCATION; replace (28, "") 

(D) OTHER INFORMATION: /standard_name= "ADENOSINE 
RIBONUCLEOTIDE " 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:6: 

GGGACGAATT CTAATACGAC TCACTATN 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_dif f erence 

(B) LOCATION: replace (8, 

(D} OTHER INFORMATION: /standard_name« "ADENOSINE 
RIBONUCLEOTIDE " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

TCACTATNGG AAGAGATGG 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_dif f erence 

(B) LOCATION: replace (8, 

(D) OTHER INFORMATION: /standard_name= "ADENOSINE 
NUCLEOTIDE " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TCACTATN 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 9: 
CCATCTCTTC CTATAGTGAG TCCGGCTGCA 
(2) INFORMATION" FOR SEQ' ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTGCCAAGCT TACCG 15 
5 (2) INFORMATION FOR SEQ ID NO ill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CTGCAGAATT CTAATACGAC TCACTATAGG AAGAGATGGC GAC 4 3 

(2) INFORMATION FOR SEQ ID NO : 12 : 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE : 
30 (A) NAME /KEY : misc_dif f erence 

(B) LOCATION: replace ( 8 , " " ) 

(D) OTHER INFORMATION: /standard_name= "ADENOSINE 
RIBONUCLEOTIDE M 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 



TCACTATNGG AAGAGATGG 



19 
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(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_dif f erence 
(BJ LOCATION: replace (28, »") 

(D) OTHER INFORMATION; /standard_name= "ADENOSINE 

RIBONUCLEOTIDE " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
GGGACGAATT CTAATACGAC TCACTATNGG AAGAGATGGC GAC 43 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
TCACACATCT CTGAAGTAGC GCCGCCGTAT GTGACGCTAG GGGTTCGCCT 50 
(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : ■ single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

10 GGGGGGAACG CCGTAACAAG CTCTGAACTA GCGGTTGCGA TATAGTCGTA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(R) TYP.E.: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
CGGGACTCCG TAGCCCATTG CTTTTTGCAG CGTCAACGAA TAGCGTATTA^ 
(2) INFORMATION FOR SEQ ID N0:17: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
30 {B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCACCATGTC TTCTCGAGCC GAACCGATAG TTACGTCATA CCTCCCGTAT 



50 



50 



50 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:18: 
GCCAGATTGC TGCTACCAGC GGTACGAAAT AGTGAAGTGT TCGTGACTAT 5( 
•2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATAGGC CATG CTTTGGCTAG CGGCACCGTA TAGTGTACCT GCCCTTATCG SO 
(2) INFORMATION FOR SEQ ID NO: 20: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TCTGCTCTCC TCTATTCTAG CAGTGCAGCG AAATATGTCG AATAGTCGGT 50 
5 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21> 

TTGCCCAGCA TAGTCGGCAG ACGTGGTGTT AGCGACACGA TAGGCCCGGT 5 0 

(2) INFORMATION FOR SEQ ID NO:22: 

20 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH; 5 0 base pairs , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION ; SEQ ID NO:22: 

TTGCTAGCTC GGCTGAACTT CTGTAGCGCA ACCGAAATAG TGAGGCTTGA SO 

(2) INFORMATION FOR SEQ ID NO: 23: 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_dif f erence 

(B) LOCATION : replace (28, ■») 

(D) OTHER INFORMATION : /s tandard_name= "ADENOSINE 
RIBONUCLEOTIDE " 
/label= rA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GGGACGAATT CTAATACGAC TCACTATNGG AAGAGATGGC GACATCTCNN NNNNNNNNNN 60 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNGT GACGGTAAGC* TTGGCAC 107 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGCCCACCT CTTTTACGAG CCTGTACGAA ATAGTGCTCT TGTTAGTAT 4 9 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TCTCTTCAGC GATGCACGCT TGTTTTAATG TTGCACCCAT GTTAGTGA 4 8 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs . 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCTCATCAGC GATTGAACCA CTTGGTGGAC AGACCCATGT TAGTGA 4 6 

(2) INFORMATION FOR SEQ ID NO:27: 



20 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 
35 CCGCCCACCT CTTTTACGAG CCTGTACGAA ATAGTGTTCT TGTTAGTAT 49 



(2) INFORMATION FOR SEQ ID NO:28: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2B: 

CCGCCCACCT CTTTTACGAG CCTGTACGAA ATAGTGCTCT CGTTAGTAT 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 8 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

TCTCAGACTT AGTCCATCAC ACTCTGTGCA TATGCCTGCT TGATGTGA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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CTCTCATCTG CTAGCACGCT CGAATAGTGT CAGTCGATGT GA 42 
(2) INFORMATION FOR SEQ ID NO: 31: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 
(BJ TYPE ; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

15 TACAGCGATT CACCCTTGTT TAAGGGTTAC ACCCATGTTA 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ATCAGCGATT AAC GCTTGTT TCAATGTTAC ACCCATGTTA 
(2) INFORMATION FOR SEQ ID NO:33: 



30 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



40 
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(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

5 TTCAGCGATT AACGCTTATT TTAGCGTTAC ACCCATGTTA 4 0 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
ATCAGCGATT CACCCTTGTT TTAAGGTTGC ACCCATGTTA 4 0 

(2) INFORMATION FOR SEQ ID NO: 35: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 0 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3S: 
ATCAGCGATT CACCCTTGTT TAAGCGTTAC ACCCATGTTG 40 
35 (2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



10 ATCAGCGATT CACCCTTGTT TTAAGGTTAC ACCCATGTTA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
ATCAGCGATT AACG CTTATT TTAGCGTTAC ACCCATGTTA 

25 

(2) INFORMATION FOR SEQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
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ATCAGCGATT AACGCTTGTT TTAGTGTTGC ACCCATGTTA 40 
(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 
ATCAGCGATT AACGCTTATT TTAGCATTAC ACCCATGTTA 40 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GCCATGCTTT 1 0 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 96/17086 



PCTYUS95/15580 



-73- 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CTCTATTTCT 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STHANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
TATGTGACGC TA 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
TATAGTCGTA 

(2) INFORMATION FOR SEQ ID NO: 44: 



(i) SEQUENCE CHARACTERISTICS: 



WO 96/17086 PCT/US95/15580 

-74- 

(Al LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) , STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

10 ATAGCGTATT A 3- 1 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATAGTTACGT CAT 13 
(2) INFORMATION FOR SEQ ID NO: 46: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



35 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.46: 
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14 



AATAGTGAAG TGTT 



(2) INFORMATION FOR SEQ ID NO: 47: 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 



(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 



11 



15 



ATAGGCCCGG T 



14 



AATAGTGAGG CTTG 



30 . 



(2) INFORMATION FOR SEQ ID NO: 49: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: RNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 

5 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GUAACUAGAG AU 12 

(2) INFORMATION FOR SEQ ID NO: 50: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
20 (ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 7. ,18 

(D) OTHER INFORMATION : /note- -Position 7-18 is RNA; the 

remainder of the sequence is DNA. " 



25 



30 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 50: 

GGAAAAGUAA CUAGAGAUGG AAGAGATGGC GACNNNNNNN NNNNNNNNNN NNNNNNNNNN 6 0 
NNNNNNNNNN NNNNNNNNNN NNNCGGTAAG CTTGGCAC 98 

(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 99 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCTAJS95/15580 

WO 96/17086 



-77- 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(ix) FEATURE: 
5 (A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION : /note= -Positions 1-24 is RNA; the 
remainder of the sequence is DNA. " 

-|0 Cxi) SEQUENCE DESCRIPTION : SEQ ID NO: SI: 

GGAAAAAGUA ACUAGAGAUG GAAGAGATGG CGACNNNNNN NNNNNNNNNN NNNNNNNNNN 60 
NNNNNNNNNN NNNNNNNNNN NNNNCGGTAA GCTTGGCAC 99 

15 (2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL : NO 
25 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CCAATAGTGC TACTG TGTAT CTCAATGCTG GAAACACGGG TTKTCTCCCG 

30 

(2) INFORMATION FOR SEQ ID NO: S3: 



SO 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
35 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



10 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: S3: 



CCAAAACAGT GGAGCATTAT ATC TACT CCA CAAAGACCAC TTTTCTCCCG 50 



{2} INFORMATION. FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
ATCCGTACTA GCATGCAGAC AGTCTGTCTG CTTTTTCATT ACTCACTCCC 50 
25 (2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucieirc acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic)' 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

35 (Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 55: 



CAATTCATGA TGACCAACTC TGTCAACACG CGAACTTTTA ACACTGGCA 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:S6: 

15 CTTCCACCTT CCGAGCCGGA CGAAGTTACT TTTTATCACA CTACGTATTG SO 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA- (genomic) 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE : NO 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
GGCAAGAGAT GG CAT ATATT CAGGTAACTG TGGAGATACC CTGTCTGCCA 50 
(2) INFORMATION FOR SEQ ID NO: 58: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

10 CTAGACCATT CACGTTTACC AAGCTATGGT AAGAACTAGA. ATCACGCGTA 50 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CGTACACGTG GAAAAGCTAT AAGTCAAGTT CTCATCATGT ACCTGACCGC • 50 

(2) INFORMATION FOR SEQ ID NO: 60: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

5 CAGTGATACA TGAGTGCACC GCTACGACTA AGTCTGTAAC TTATTCTACC 5 0 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 
ACCGAATTAA ACTACCGAAT AGTGTGGTTT CTATGCTTCT TCTTCCCTGA 50 
(2) INFORMATION FOR SEQ ID NO: 62: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

35 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO:62: 



CAGGTAGATA TAATGCGTCA CCGTGCTTAC ACTCGTTTTA TTAGTATGTC 



50 
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(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 49 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

15 CCCTACAACA CCACTGGGCC CAATTAGATT AACG CTATTT TATAACTCG 4 9 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH : 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CCAAACGGTT ATAAGACTGA AAACTCAATC AATAGCCCAA TCCTCGCCC 4 9 

(2) INFORMATION FOR SEQ ID NO: 65: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE; NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CACATGTATA CCTAAGAAAT TGGTCCCGTA GACGTCACAG ACTTACGCCA 50 
(2) INFORMATION FOR SEQ ID NO: 66; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CACAACGAAA ACAATCTTCC. TTGGCATACT GGGGAGAAAG TCTGTTGTCC 50 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE : NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

5 CACACGAACA TGTCCATTAA ATGGCATTCC GTTTTTCGTT CTACATATGC SO 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 4 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

.15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
CAGAACGAGG GTCTTGTAAG ACTACACCTC CTCAGTGACA ATAATCCTG 4 9 

(2) INFORMATION FOR SEQ ID NO: 69: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CACTACAGCC TGATATATAT GAAGAACAGG CAACAAGCTT ATGCACTGG 



49 
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(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



SO 



MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
, 5 GGGTACATTT ATGATTCTCT TATAAAGAGA ATATCGTACT CTTTTCCCCA 

(2) INFORMATION FOR SEQ ID NO:71: 

(i) SEQUENCE CHARACTERISTICS: 
2Q ( A ) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 

30 ^n^rrrr TTATACGTGA AACTTCCAGT AGTTTCCTA 

CCAAAGTACA TTCCAACCCC TTATAU^i 

(2) INFORMATION FOR SEQ ID NO: 72: 



35 



(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pair 

(B) TYPE: nucleic acid 
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( C ) STRAND EDNES S : 3 ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 72 : 

10 CTTGAAGATC CTCATAAGAC GATTAAACAA TCCACTGGAT ATAATCCGGA 5 0 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
CGAATAGTGT CCATGATTAC AC CAATAACT GCCTGCCTAT CATGTTTATG 50 
(2) INFORMATION FOR SEQ ID NO: 74: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

5 CCAAGAGAGT ATCGGATACA CTTGGAACAT AGCTAACTCG AACTGTACCA 

(2) INFORMATION FOR SEQ ID NO; 75: 

(i) SEQUENCE CHARACTERISTICS: 
<|Q (A) LENGTH: 4 8 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



20 



CCACTGATAA ATAGGTAACT GTCTCATATC TGCCAATCAT ATGCCGTA 



48 



(2) INFORMATION FOR SEQ ID NO: 76: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



CCCAAATTAT AAACAATTTA ACACAAGCAA AAGGAGGTTC ATTGCTCCGC 



50 
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(2) INFORMATION FOR SEQ ID NO; 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

15 CAATAAACTG GTGCTAAACC TAATACCTTG TATCCAAGTT ATCCTCCCCC 50 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CCGAATGACA TCCGTAGTGG AACCTTGCTT TTGACACTAA GAAGCTACAC SO 
(2) INFORMATION FOR SEQ ID NO: 79: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genamic) 
(iii) HYPOTHETICAL: NO 

(ivj ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
Z CAT AAC AAA T AC CAT AGTA AAGATCTGCA TTATATTATA TCGGTCCACC 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
CAGAACAAAG ATCAGTAGCT AAACATATGG TACAAACATA CCATCTCGCA 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CCTTTAGTTA GGCTAGCTAC AACGATTTTT CCCTGCTTGG CAACGACAC 4 9 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
CTCCCTACGT TACACCAGCG GTACGAATTT TCCACGAGAG GTAATCCGCA 50 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



CGGCACCTCT AGTTAGACAC TCCGGAATTT TTCCCC 
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(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

1 5 CGGCACCTCT AGTTAGACAC TCCGGAATTT TAGCCTACCA TAGTCCGGT 

(2) INFORMATION FOR SEQ ID NO: 85: 

<i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH : 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii> MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
CCCTTTGGTT AGGCTAGCTA CAACGATTTT TCCCTGCTTG AATTGTA 
(2) INFORMATION FOR SEQ ID NO: 86: 



47 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B ) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

10 CCCTTTGGTT AGGCTAGCTA CAACGATTTT TCCCTGCTTG ACCTGTTACG A 51 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 {ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 



25 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



CCTTTAGTTA GGCTAGCTAC AACGATTTTT CCCTGCTTGG AACGACAC 48 



(2) INFORMATION FOR SEQ ID NO: 88: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
CATGGCTTAA TCATCCTCAA TAGAAGACTA CAAGTCGAAT ATGTCCCCCC 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
CAACAGAGCG AGTATCACCC CCTGTCAATA GTCGTATGAA ACATTGGGCC 
(2) INFORMATION FOR SEQ ID NO: 90: 

( i ) SEQUENCE' CHARACTERISTICS : 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



TACCGACAAG GGGAATTAAA AGCTAGCTGG TTATGCAACC CTTTTCGCA 
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(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 49 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 

fxi) SEQUENCE DESCRIPTION : SEQ ID NO: 91: 

15 CTCGAAACAG TGATATTCTG AACAAACGGG TACTACGTGT TCAGCCCCC 4 9 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 
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(xi) SEQUENCE" "DESCRIPTION : SEQ ID NO:92: 
CCAATAACGT AACCCGGTTA GATAAGCACT TAGCTAAGAT GTTTATCCTG 50 
(2) INFORMATION FOR SEQ ID NO: 93: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
CAATACAATC GGTACGAATC CAGAAACATA ACGTTGTTTC AGAATGGTCC 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GCAACAACAA GAACCAAGTT ACATACACGT TCATCTATAC TGAACCCCCA 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:9s": 
CCTTTGAGTT CCTAAATGCC GCACGGTAAG CTTGGCACAC TTTGACTGTA 5 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
CAAAGATCTC ACTTTGGAAA TGCGAAATAT GTATATTCGC CCTGTCTGC 4 5 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
C CACGT AGAA TTATCTGATT TATAACATAA CGCAGGATAA CTCTCGCCCA 50 
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(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 
CAGAAGAAAG TGTCGTCTCC AGATATTTGA GTACAAGGAA CTACGCCC 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
CATGAAGAAA TAGGACATTC TACAGGCTGG ACCGTTACTA TGCCTGTAGG 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL : NO 

<iv) ANTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 

10 CATAGGATAA TCATGGCGAT GCTTATGACG TGTACATCTA TACCTT 4 6 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CAGATGATCT TCCTTTAAAG ACTACCCTTT . AAAG AAAC AT AAGGTACCCC 



50 
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We Claim: 

1. A catalytic DNA molecule having site-specific endonuciease activity. 

2. The catalytic DNA molecule of claim 1, wherein said endonuciease 
activity is specific for a nucleotide sequence defining a cleavage site comprising single- 

5 stranded nucleic acid in a substrate nucleic acid sequence. 

3. The catalytic DNA molecule of claim 2, wherein said single stranded 
nucleic acid comprises RNA, DNA, modified RNA, modified DNA, nucleotide analogs, or 

composites thereof. 

4. The catalytic DNA molecule of claim 2, wherein said substrate nucleic 
10 acid comprises RNA, DNA, modified RNA, modified DNA, nucleotide analogs, or 

composites thereof. 

5. The catalytic DNA molecule of claim 2. wherein said endonuciease 
activity comprises hydrolytic c.eavage of a phosphoester bond at said cleavage site. 

6. The catalytic DNA molecule of claim 1 . wherein said molecule is single- 

15 stranded. 

7. The catalytic DNA molecule of claim 1 , wherein.said molecule includes 

one or more hairpin loop structures. 

8. The catalytic DNA molecule of claim 1 , wherein said substrate nucleic 
acid sequence is attached to said catalytic DNA molecule. 

20 9 . The catalytic DNA molecule of claim 1. wherein said substrate nucleic 

acid sequence is not attached to said catalytic DNA molecule. 

10. The catalytic DNA molecule of claim 1 . wherein said catalytic DNA 
molecule comprises a nucleotide sequence selected from the group consisting of: 
SEQ ID NO 3 and SEQ ID NOS 14 through 22. 
25 , , . The catalytic DNA molecule of claim 1 . wherein said catalytic DNA 

molecule comprises a nucleotide sequence selected from the group consisting of: 
SEQ ID NOS 23 through 30. 

1 2. The catalytic DNA molecule of claim 1 , wherein said catalytic DNA 
molecule c=mprises-a.nucleotide=sex l uenoe,-se.e C ted^.om the- group consisting of: 

30 SEQ ID NOS 31 through 39. 

! 3 . The catalytic DNA molecule of claim 1 . wherein said catalytic DNA 
molecule comprises a nucleotide sequence selected from the group consisting of: 
SEQ ID NOS 52 through 101 . 

1 4. The catalytic DNA molecule of claim 1 1 , 1 2, or 1 3. wherein said 
35 endonuciease activity is enhanced by the presence of Mg 2 '. 

! 5. The catalytic DNA molecule of claim 1 . wherein said catalytic DNA 
molecule has a substrate binding affinity of about 1 »M or less. 
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16. The catalytic DNA molecule of claim 1, wherein said catalytic DNA 
molecule binds substrate with a K 0 of less than about 0.1 /yM. 

17. The catalytic DNA molecule of claim 2, wherein said nucleotide 
sequence defining said cleavage site comprises at least one nucleotide. 

5 18. The catalytic DNA molecule of claim 1, wherein said endonuclease 

activity is enhanced by the presence of a divalent cation. 

19. The catalytic DNA molecule of claim 18, wherein said divalent cation is 
selected from the group consisting of Pb 2 + , Mg 2 \ Mn 2 \ Zn 2 \ and Ca 2 \ 

20. The catalytic DNA molecule of claim 1 , wherein said endonuclease 
10 activity is enhanced by the presence of a monovalent cation. 

21 . The catalytic DNA molecule of claim 20, wherein said monovalent cation 
is selected from the group consisting of Na + and K*. 

22. The catalytic DNA molecule of claim 1, wherein said catalytic DNA 
molecule comprises a conserved core flanked by first and second substrate binding 

1 5 rergions. 

23. The catalytic DNA molecule of claim 22, further comprising one or more 
spacer nucleotides between said conserved core and said substrate binding region. 

24. The catalytic DNA molecule of claim 22, wherein said conserved core 
comprises one or more conserved regions. 

20 25. The catalytic DNA molecule of claim 24, wherein said one or more 

conserved regions includes a nucleotide sequence selected from the group consisting of: 

CG; 

CGA; 

AGCG; 
25 AGCCG; 

CAGCGAT; 

CTTGTTT; and 

CTTATTT. 

26. The catalytic DNA molecule of claim 24, further comprising one or more 
30 variable or spacer nucleotides between said conserved regions in said conserved core. 

27. The catalytic DNA molecule of claim 22. wherein said first substrate 
binding region includes a nucleotide sequence selected from the group consisting of: 

CATCTCT; 
GCTCT; 

35 TTGCTTTTT; 

TGTCTTCTC; 
TTGCTGCT; 
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GCCATGCTTT; 
CTCTATTTCT 
GTCGGCA; 
CATCTCTTC; and 
5 ACTTCT. 

28. The catalytic DNA molecule of claim 22. wherein said second substrate 
binding region includes a nucleotide sequence selected from the group consisting of: 

TATGTGACGCTA; 

TATAGTCGTA; 
1 0 ATAGCGTATTA; 

ATAGTTACGTCAT; 

AATAGTGAAGTGTT; 

TATAGTGTA; 

ATAGTCGGT; 
1 5 ATAGGCCCGGT; 

AATAGTGAGGCTTG; and 

ATGNTG. 

29. The catalytic DNA molecule of claim 22. further comprising a third 
substrate binding region, wherein said third region includes a nucleotide sequence 

20 selected from the group consisting of: 

TGTT; 
TGTTA; and 
TGTTAG. 

30. The catalytic DNA molecule of claim 29, further comprising one or more 
25 spacer regions between said substrate binding regions. 

31 . A composition comprising two or more populations of catalytic DNA 
molecules according to claim 1, wherein each population of catalytic DNA molecules is 
capable of cleaving a different nucleotide sequence in a substrata. 

32. A composition comprising two or more populations of catalytic DNA 
30 molecules according to claim 1 , wherein each population of catalytic DNA molecules is 

capable of recognizing a different substrate. 

33. A method of selecting a catalytic DNA molecule that cleaves a substrate 
nucleic acid sequence at a specific site, comprising the following steps: 

a. obtaining a population of single-stranded DNA molecules; 
35 b . admixing nucleotide-containing substrate molecules with said population 

of single-stranded DNA molecules to form an admixture; 
c. maintaining said admixture for a sufficient period of time and under 
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predetermined reaction conditions to allow single-stranded DNA molecules in 
said population to cause cleavage of said substrate sequences, thereby 
producing substrate cleavage products; 

d. separating said population of single-stranded DNA molecules from said 
5 substrate sequences and substrate cleavage products; and 

e. isolating single-stranded DNA molecules that cleave nucleotide- 
containing substrate at a specific site from said population. 

34. ' The method of claim 33, wherein said substrate comprises RNA. 

35. The method of claim 33, wherein said DNA molecules that cleave said 
10 substrate at a specific site are tagged with an immobilizing agent. 

36. The method of claim 35, wherein said agent comprises biotin. 

37. The method of claim 35, wherein said isolating step further comprises 
exposing said tagged DNA molecules to a solid surface having avidin linked thereto, 
whereby said tagged DNA molecules become attached to said solid surface. 

1 5 38. A method of cleaving a phosphoester bond, comprising: 

a. admixing a catalytic DNA molecule capable of cleaving a substrate 
nucleic acid sequence at a defined cleavage site with a phosphoester bond- 
containing substrate, to form a reaction admixture; and 

b. maintaining said admixture under predetermined reaction conditions to 
20 allow said catalytic DNA molecule to cleave said phosphoester bond, thereby 

producing a population of substrate products. 

39. The method of claim 38, further comprising the steps of 

a. separating said products from said catalytic DNA molecule; and 

b. adding additional substrate to said catalytic DNA molecule to form a new 
25 reaction admixture. 

40. The method of claim 38, wherein said substrate comprises RNA. 

41. The method of claim 38, wherein said predetermined reaction conditions 
include the presence of a monovalent cation, a divalent cation, or both. 

42. A method of engineering- catalytic DNA fc mojeouies^ that cleave 
30 phosphoester bonds, comprising the following steps: 

a. obtaining a population of single-stranded DNA molecules; 

b. introducing genetic variation into said population to produce a variant 
population; 

c. selecting individuals from said variant population that meet 
35 predetermined selection criteria; 

d. separating said selected individuals from the remainder of said variant 
population; and 
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e. amplifying said selected individuals. 

43. A non-naturally-occurring catalytic DNA molecule comprising a 
nucleotide sequence defining a conserved core flanked by one or more recognition 
domains, variable regions, and spacer regions. 
5 44. The catalytic DNA molecule of claim 43, wherein said nucleotide 

sequence defines a first variable region contiguous or adjacent to the 5--terminus of the 
molecule, a first recognition domain located 3'-terminal to the first variable region, a first 
spacer region located 3'-terminal to the first recognition domain, a first conserved region 
located 3'-terminal to the first spacer region, a second spacer region located 3'-terminal 
1 0 to the first conserved region, a second conserved region located 3--terminal to the 

second spacer region, a second recognition domain located 3'-terminal to the second 
conserved region, and a second variable region located 3'-terminal to the second 

recognition domain. 

45. The catalytic DNA molecule of claim 43, wherein said nucleotide 

1 5 sequence defines a first variable region contiguous or adjacent to the 5'-terminus of the 

molecule, a first recognition domain located S'-terminal to the first variable region, a first 
spacer region located 3'-terminel to the first recognition domain, a first conserved region 
located 3'-terminal to the first spacer region, a second spacer region located Spermine! 
to the first conserved region, a second conserved region located 3--terminal to the 

20 second spacer region, a second recognition domain located 3'-terminal to the second 

conserved region, a second variable region located 3'-terminal to the second recognition 
domain, and a third recognition domain located S'-terminal to the second variable region. 
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